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Introduction 


Digital Literary Studies in the United States 


It was also interesting to see, during the convention and after, 
a debate among the Twitter crowd about the label “digital 
humanities" and whether it was accurate or useful and how 
to get humanists, digital and otherwise, to talk more (or more 
usefully) to one another. A catchall phrase comes in handy— 
it’s hard to imagine the NEH's establishing an Office of Cool 
Scholarship Done With Digital Tools—but it doesn’t do jus- 
tice to the very different kinds of work done under that label. 
Maybe the term is just a placeholder, and the day is not far 
off when people won’t feel the need to make a distinction 
between the humanities and the digital humanities. 


—Jennifer Howard, “The MLA Convention in 
Translation,” The Chronicle of Higher Education, 
December 31, 2009 


Scholars who self define as digital humanists joke that any public 
discussion on digital humanities will inevitably turn to the ques- 
tion: “What are the digital humanities?” Digital humanists spend 
what seems to be an inordinate amount of time discussing, defin- 
ing, and explaining what, in many ways, is an amorphous, fluid 
area of study. Books, articles, blog posts, tweets, conferences, and 
conference papers that define “digital humanities” have grown ex- 
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ponentially, and so common is the query that Matthew Kirschen- 
baum has called such “what is” essays “genre pieces.”! While the 
digital humanities as an area of scholarly inquiry might appear to 
be a recent invention, utilizing computing technology to answer 
humanistic questions is often dated to 1946 and Father Busa's In- 
dex Thomisticus, a concordance program.’ Early digital work was 
conducted on mainframe computers using punch cards or paper 
tape and focused on concordance development, authorship stud- 
ies, and linguistic analysis. As digital technology applications for 
humanities materials developed in the 1980s and 1990s, driven in 
large part by increased use of microcomputing and the emergence 
of the World Wide Web, scholars adopted the term humanities 
computing to describe their work. By the early 1990s, humani- 
ties computing was well established with growing numbers of 
humanities computing departments, centers and institutes, spe- 
cialized journals, an annual conference, and three distinct schol- 
arly organizations: the Association for Literary and Linguistic 
Computing, the Association for Computers and the Humani- 
ties, and the Society for Digital Humanities/Société pour l'étude 
des médias interactifs. Focused on “information technology as a 
tool and written texts as a primary object of study (for linguistic 
analysis), according to Patrik Svenson, humanities computing 
was a cohesive scholarly pursuit. The World Wide Web (web or 
WWW), however, would change everything and set the stage for 
the current tensions in the field. In her comprehensive history of 
humanities computing, Susan Hockey argues that at the onset of 
the web, *some long-term humanities computing practitioners 
had problems in grasping the likely impact of the Web in much 
the same way as Microsoft did.”* In Hockey’s analysis, scholars 
saw the web as a space devoid of serious activity and unable to 
support scholarship, but she also, and perhaps more importantly, 
predicts the fissures in humanities computing that would explode 
in contemporary conflict. Instead of ensuring a cohesive humani- 
ties computing community, who shared a good deal of agreement 
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on technique and methodology, the web spurred a new set of us- 
ers who exploited the web’s flexibility and openness to diversify 
scholarly questions and methodologies, often viewed as a direct 
assault on scholarly rigor and exclusiveness. 

By the mid-2000s, humanities computing was declining as a 
term of choice, with Willard McCarty’s Humanities Computing, 
published in 2005, signaling the last substantive use of the term? 
During the same year, the Association for Literary and Linguis- 
tic Computing, the Association for Computers and the Humani- 
ties, and the Society for Digital Humanities/Société pour l'étude 
des médias interactifs merged into the Alliance of Digital Hu- 
manities Organizations (ADHO), and humanities computing be- 
came digital humanities (DH), the term that John Unsworth had 
coined for the 2004 Blackwell Companion to Digital Humanities. 
Regardless of the acceptance and use of the term digital humani- 
ties, a working definition remains elusive even to those that call 
themselves digital humanists. It is not clear whether the digital 
humanities are a field, a technique, or a trend, or if such defini- 
tions are antithetical to the scholarly project. Scholars argue over 
whether the digital humanities should emphasize digital building 
or theorizing and contest hack versus yack. The metaphor of the 
big tent, where all those interested in scholarly uses of technol- 
ogy might reside, continues to be a point of contention, leaving 
some, such as Matthew Kirschenbaum, to see the term digital 
humanities as tactical rather than informational. 

'This book is written as the digital humanities become increas- 
ingly visible, with articles about DH appearing in the Chronicle of 
Higher Education, increasing numbers of jobs posted in the MLA 
job list, and a growing number of DH centers across the United 
States. Yet at disciplinary conferences, college and university 
meetings, in social media, and in trade publications, such as Inside 
Higher Ed, the “what is DH” question continues to be voiced. 
This monograph does not seek to provide one definition of digi- 
tal humanities. Instead, the project suggests that digital humani- 
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ties is, in many ways, a living term, ever evolving, ever shifting in 
response to particular pressures of scholarship, the academy, and 
the individual. Accordingly, the project traces the various theo- 
retical and methodological branches of literary digital humani- 
ties to reveal how seemingly unrelated literary movements have 
shaped current digital humanities practice. Many of the early 
books on digital humanities have focused on the breadth of the 
digital humanities, arguing that digital humanities is an inclusive 
form that is able to be all to all fields. While such a tactic serves 
the political purpose of making digital humanities indispensable, 
it obscures the impact of practitioners from various disciplinary 
backgrounds who have shaped technology to address their schol- 
arly investigations. This book responds to the need for a coherent 
and focused analysis of the impact of discipline on the emergence 
of digital literary studies. I hope that this approach leaves the 
way open for others to think through digital practices in related 
areas such as game studies, new media studies, postcolonial stud- 
ies, history, architecture, information studies, computer science, 
language studies, and archaeology. 

I limit this study to the American academy, though I am fully 
aware that digital humanities in America did not develop in a 
vacuum. Professional organizations cross national boundaries, 
scholars move to jobs in different countries, and ideas are shared 
internationally. Yet formulations of digital work are constructed 
by national contexts shaped, in part, by funding and reward sys- 
tems. Many digital projects in the United States, for example, 
have been supported by grants from the National Endowment 
for the Humanities (NEH) Start Up program. Focused on tech- 
nological innovation, NEH Start Up grants have spurred the 
production of interoperable digital tools and technological stan- 
dards. In other countries, such as Canada, larger digital research 
budgets and larger grant payouts have encouraged the pursuit 
of broader projects with huge collaborative teams. Additionally, 
reward systems have impacted the types of projects that scholars 
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are willing to pursue. In countries where grant monies are nec- 
essary to secure tenure and promotion, digital humanities tends 
to be more prominent than in countries where grant funding is 
not valued. Digital projects that emphasize outreach and public 
impact are increasingly the norm in places like the United King- 
dom, where funding models are driven by measurements that 
emphasize such criteria. Given the influence of the localized aca- 
demic environment on the formation of digital humanities, it is 
pertinent to examine the practice within a particularized context. 

In addition to situating digital humanities in its appropriate 
academic and national context, this project seeks to locate dis- 
ciplinary influences on the construction of digital humanities. 
There is no doubt that the broader term digital humanities en- 
compasses multiple areas of scholarly inquiry—from literary stud- 
ies, to linguistics, to classics, to history and more— but the reality 
of the situation is that the institution that fuels scholarship—the 
academy—has not made much progress away from traditional 
disciplinary structures. The impact of interdisciplinary groups, 
departments, and scholarship is growing, but most scholars con- 
tinue to be trained and practice in a disciplinary manner. Traces of 
tbe Old, Uses of tbe New: Tbe Emergence of Digital Literary Studies 
grapples with these crucial issues by tracing the historical devel- 
opment, theoretical roots, and emergent trends of what is now 
being called digital humanities within literary studies. Conflicts 
within the larger digital humanities are revealed to be driven by 
long-held disciplinary understandings of approaches, methodol- 
ogies, and values. Fields *do" scholarship differently. Digital hu- 
manities scholars have long operated under the false conception 
that new technological approaches and collaborative research 
negate the particularities of disciplinary training. This project 
seeks to expose the naturalized assumptions of interdisciplinarity 
in digital humanities. 

A number of early published volumes discuss digital humani- 
ties, such as the Blackwell Companion to Digital Humanities, Black- 
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well Companion to Digital Literary Studies, Willard McCarty’s 
Humanities Computing, and Jerome McGann's Radiant Textuality: 
Literature after the World Wide Web.* In such a rapidly changing 
area of inquiry, these works, which have served us well, are be- 
coming dated. Luckily we are seeing an explosion of volumes 
focused on defining digital humanities, including Understanding 
Digital Humanities, edited by David M. Berry; Steven E. Jones's 
The Emergence of Digital Humanities; Literary Studies in the Digital 
Age: An Evolving Antbology, edited by Kenneth M. Price and Ray 
Siemens; Digital Humanities, edited by Anne Burdick et al.; De- 
fining Digital Humanities: A Reader, edited by Melissa Terras et al., 
among others.’ Some digital humanities volumes have focused 
on specific issues related to digital humanities, as do Susan Hock- 
ey's Electronic Texts in the Humanities: Principles and Practice; Elec- 
tronic Textual Editing, edited by Lou Burnard, Katherine O'Brien 
O'Keeffe, and John Unsworth; and Dan Cohen’s Hacking the 
Academy; conflicts within the field, such as Debates in tbe Digital 
Humanities, edited by Matthew K. Gold; or specific techniques of 
analysis, such as Matthew L. Jockers's Macroanalysis: Digital Metb- 
ods and Literary History. © Perhaps unique in digital humanities is 
that print scholarship is beginning to wield less power in shap- 
ing the area as blog posts, tweets, listserv discussions, and digi- 
tal projects gain attention. Alan Liu’s influential post “Where Is 
Cultural Criticism in the Digital Humanities?,” John Unsworth’s 
early “What Is Humanities Computing and What Is Not?," Mat- 
thew Kirschenbaum's “What Is Digital Humanities and What's 
It Doing in English Departments?,” or Bethany Nowviskie’s illu- 
minating *Eternal September of the Digital Humanities" have all 
made crucial interventions in digital humanities." As the digital 
humanities writ large is shaped by a growing body of criticism, 
the exploration of specialized inquiry areas gains momentum. 
In addition to the above essays, full-length volumes including 
Daniel Cohen and Roy Rosenzweig's Digital History: A Guide to 
Gatbering, Preserving, and Presenting tbe Past on tbe Web and my 
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coedited volume The American Literature Scholar in the Digital Age 
become a necessary means of contouring our understanding of a 
dynamic area of scholarly inquiry. 

Traces of the Old, Uses of the New: The Emergence of Digital Liter- 
ary Studies analyzes the emergence of digital literary scholarship 
over the last 25 years. This project uses the scholarship and prod- 
ucts of the digital turn to define historical and emergent trends; I 
analyze a range of materials including digital editions, digital ar- 
chives, etexts, scholarly writing, digital artifacts (including tools 
and metadata), and interviews with key players in the field. While 
some critics have argued that digital humanities are an outlier to 
literary studies, this project reveals that many of the theoretical 
elements of literary studies are retained in digital literary studies. 
My project defines and analyzes four dominant areas of work in 
what I call digital literary studies: the digital edition form, the 
digital archive form, cultural studies approaches, and literary data 
approaches. I define each of these areas as foundational for digital 
literary studies and argue that these forms function within a con- 
tinuum of production, with new techniques, such as datamining, 
gaining prestige within the field while never fully eliding earlier 
practice, such as the digital edition. 

In chapter 1 I trace the foundational form of digital literary 
production, the digital edition. “The Rationale of Holism: Tex- 
tual Studies, the Edition, and the Legacy of the Text Entire” ar- 
gues that the centrality of the digital edition form that emerged 
from the combative field of textual studies transferred key ideas 
regarding texts and materiality to digital literary studies. Key 
concepts examined in the chapter include a distrust of the digital 
environment, the holistic text, and the desire for editorial control 
of the text. While textual studies has given digital literary studies 
an infrastructure through which we might represent the material 
text, our textual studies roots have also transferred the unfortu- 
nate legacy of the unfair representation of editing as uncritical 
and mechanical. Further, the editorial emphasis on purity and 
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the inherited problematic treatment of issues of diversity has im- 
pacted the way in which digital literary studies has selected ma- 
terials to digitize. Textual studies work has not neatly transferred 
into the digital nor has textual studies remained the dominant 
mode within digital literary studies, but the impact of textual 
studies on the field is undeniable. 

Chapter 2, “The Era of the Archive: The New Historicist 
Movement and Digital Literary Studies," tracks the archive fever 
that overtook digital humanities in the 1990s, arguing that the 
digital archive was a contradictory form that sought to create an 
idealized archive. Work by Jerome McGann, Kenneth M. Price, 
Alan Liu, Martha Nell Smith, and others is examined to deter- 
mine how specific tenets of new historicism, such as the use of an 
anecdote within a complex social system, form the digital archive 
model. Using examples from digital archives and print scholar- 
ship, I argue that the digital archive imagines the text within an 
expansive yet holistic system, with the textual materials designed 
to interact with a wide range of cultural materials. Tracking the 
rise of open access, web delivered archives, the chapter reveals the 
self-reflexivity of archive construction with particular attention 
to TEI/XML encoding approaches. Examination of theorists in- 
cluding Clifford Geertz, Jacques Derrida, and Michel Foucault 
exposes the impact of new historicist thinking on digital liter- 
ary studies treatment of power structures, canon, and apparatus. 
The archival turn in American digital literary work has created a 
theoretical foundation for digital literary studies and allowed for 
the development of standardized approaches. I end the chapter 
by examining the growing tension between digital literature and 
digital history over the treatment of archives and argue that this 
tension is a prime example of the difficulty scholars have in defin- 
ing the umbrella term of digital humanities. 

Chapters 3 and 4 highlight emergent trends in digital liter- 
ary production. Chapter 3, "What's In and What's Out?: Digi- 
tal Canon Cautions,” charts the impact of cultural studies ap- 
proaches on digital humanities. Examining what I call digital 
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recovery projects, the chapter focuses on activist, small-scale 
projects that used digitization to expand what such scholars saw 
as an outmoded new critical literary canon that excluded work 
by women, people of color, queers, and others. Embedded within 
contemporary understandings of the Internet, projects utilized 
entry-level technology skills to produce digital archives and cu- 
rated, hyperlinked sites to digitize texts that would expand the 
canon. Cocurrent with individual scholarly production was the 
heyday of etext centers, focused on producing a large volume 
of digitized cultural materials. Examining early digital recovery 
projects including Alan Liu’s Voice of the Shuttle, Jean Lee Cole’s 
The Winnifred Eaton Digital Archive, the Women Writers Proj- 
ect, Glynis Carr’s The Online Archive of Nineteenth-Century U.S. 
Women’s Writing, and others, the chapter uncovers that not only 
has the early wave of small recovery projects slowed but projects 
have begun to disappear. I interrogate the impact of infrastruc- 
ture, community, technological standards, and economics on the 
construction of digital literary canons, providing a roadmap for 
the construction of a broader digital literary canon. 

Chapter 4, “Data and the Fragmented Text: Tools, Visualiza- 
tion, and Datamining or is Bigger Better?,” focuses on tool de- 
velopment, visualization, and datamining, three crucial subareas 
of the interpretive bent of digital studies. Current work on visu- 
alizing and datamining is examined in the chapter, with careful 
attention to optical character recognition (OCR) and data sets. 
The chapter argues that there is an unresolved and longstanding 
division between interpretive and representational uses of tech- 
nology within digital literary studies, particularly in the develop- 
ment of tools. Scholars interested in constructing tools to sup- 
port digital work recognize that tool development is expensive 
and difficult and often leads to highly idiosyncratic, nonexten- 
sible, and unsustainable tools. The alternative, generalized tools, 
raises questions of use value, as many tools are not designed to 
address humanities concerns. Examining a variety of tools, such 
as Wordseer, Juxta, and the Versioning Machine, and datamin- 
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ing projects, such as Lauren Klein’s work on James Hemings 
and Matthew Wilkens's analysis of Civil War American fiction, 
I argue that while such approaches are in their infancy, the pos- 
sibilities are myriad. Instead of rejecting algorithmic approaches 
as flawed, we must focus on the construction of data in tandem 
with experimental algorithmic manipulations. 

In the book's final chapter, “Notes on the Future of Digital 
Literary Studies," I consider the ways in which digital literary 
studies might come to terms with its history. Rejecting the hack- 
neyed “what are the digital humanities" genre, the chapter re- 
visits each of the formative fields discussed in the book to spec- 
ulate how we might address current conflicts within the field. 
The chapter examines the current contours of debate in digital 
literary studies, with particular attention to formations of inside/ 
outsider and resistance to the field by traditional literary scholars. 
With attention to current flash points of digital literary studies 
including the hack/yack divide, concerns regarding inclusivity, 
and the so-called innate conservatism of digital humanities, the 
chapter calls for a return to activist digital innovation that is di- 
vergent, not convergent. 

It is crucial that scholars map the field of digital literary stud- 
ies. My analysis of the development of digital literary studies itself 
owes much to new historicism, which theorizes that movements 
should be contextualized within power structures and examines 
the impact of theoretical, economic, social, and historical impacts 
on scholarship. By examining the evolution of what we have come 
to call digital literary studies within such contexts, we might bet- 
ter understand how the form has both shifted and maintained 
certain conceptions of text, literature, and scholarship. It is my 
hope that this volume will encourage interest in the emergence 
of digital scholarship and that others will choose to map technol- 
ogy applications within their area of expertise. 


CHAPTER I 


The Rationale of Holism 


Textual Studies, the Edition, and the 
Legacy of the Text Entire 


For two millennia, the principal storage mechanism for the 
world's intellectual memory took the form of manuscript and 
printed books. These days, students and scholars have avail- 
able to them a rapidly growing influx of digitized material, 
and the internet offers enormous possibilities for increasing 
the use of scanned older materials by making them more 
broadly available than would ever have been possible in a 
print environment. But we cannot provide posterity with an 
electronic copy of Walt Whitman's Leaves of Grass and, by so 
doing, absolve ourselves of the responsibility for preserving 
copies of the original, early, printed editions of Whitman's 
book and the manuscripts that lie behind them. We bave no 
right to deprive tbe future of the past. 
—Pamphlet, Rare Book School, the University of 
Virginia, 2009 


Digital editions are some of the most visible early digital projects 
in digital literary studies, so predominant that one might argue 
that the digital edition is the primary form of the first genera- 
tion of the field. For the purposes of this study, digital editions 
are those projects that meet the MLA “Guidelines for Editors of 
Scholarly Editions," and, as such, present a reliable text estab- 
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lished by accuracy, adequacy, appropriateness, consistency, and 
explicitness.! Emerging from textual studies,’ such “work explores 
the ideological structures and material processes that shape the 
transmission, reception, production, and interpretation of texts.” 
By probing the combative field of American textual studies in the 
1990s, this chapter will reveal the roots of practices that are now 
accepted as standard in digital literary studies, such as the focus 
on digital editing and widely accepted models of form and layout 
of digital materials. These representations emerged out of what 
I call a ^whole text" approach, a cohesive print-to-digital model 
that features interrelated textual materials, often in print book 
form, rather than an expansive and fragmented representation 
of text, as is increasingly the case with data-based deformations. 
'The digital edition privileges the structure of the book, which 
is viewed as a self-contained entity with a naturalized means of 
displaying knowledge and is replicated in most aspects of digital 
edition creation, from display to the treatment of data. The rep- 
lication of print in a digital form is designed to increase access 
to materials and aid examination of aspects of the original (il- 
lustrations, typography, etc.) that is rarely possible in modern re- 
prints. While there is no doubt that such materials are beneficial 
to scholars, the early period of digital editions did not provide 
proof for the claim that digitization allows scholars to ask and 
answer questions in new ways, one crucial argument for the sup- 
port of digital literary studies. In addition to this limitation, our 
textual studies roots brought the unfortunate marginalized status 
of editing to the greater digital literary studies, reinforcing an 
outsider position for our work that scholars continue to battle, 
and reinforcement of a traditional canon. 

It is not an accident that textual studies scholars were intrigued 
with the digital in the early 1990s. Literary editorial scholarship 
was in upheaval.* Conflict within—authorial intent versus the so- 
cial construction of the text—and without—the devaluation of 
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editorial work by the larger discipline—made the field extremely 
unstable. As editors began to search for a way to create a better 
edition and to reinsert editing into the core of literary studies, 
they began to consider digital technologies as a possible help- 
mate.’ This chapter is not intended to serve as a history of schol- 
arly editing but to reveal how the history of scholarly editing has 
impacted the way that digital literary studies represents texts and 
to demonstrate that the conflict within textual studies approaches 
has, in some ways, seeded contemporary tensions regarding digi- 
tal texts. 

1951 marked the beginnings of the rationale papers, the es- 
says that textual scholars wrote to advocate varying practices in 
the field. The 1951 “Rationale of Copy-Text” by W. W. Greg 
launched what came to be known as the Greg-Bowers model, 
“the dominant mode of Anglo-American textual criticism, insti- 
tutionally and academically” and which dominated the field un- 
til challenged in the 1980s.° Greetham describes this school as 
“the copy-text school of eclectic editing designed to produce a 
reading clear-text whose features were a fulfillment of authorial 
intentions by the selection of authorially sanctioned substantive 
variants from different states of texts, and whose copy-text was 
selected on the basis of its accidentals being as close as possible 
to authorial usage." The emphasis on the idealized and preex- 
isting authorially sanctioned texts, the “Work,” to use Tanselle's 
term, was premised on the belief that “[t]hose texts, being reports 
of works, must always be suspect; and, no matter how many of 
them we have, we never have enough information to know with 
certainty what the works consist of.” This approach situates tex- 
tual studies as separate, “anterior to literary criticism,” and “the 
scholars first job" according to the first edition of An Introduction 
to Bibliographical and Textual Studies.” G. Thomas Tanselle took up 
the mantle of defending this version of textual studies, arguing 
that 
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the textual way of thinking—adjudicates between the com- 
peting claims of a basic dilemma: the feeling, on the one 
hand, that all artifacts, by their survival, deserve our respect, 
either because they put us in touch with what has gone be- 
fore or because we feel a social obligation to pass along in- 
tact what we have received; and, on the other, the realization 
that they may fail to represent, for a variety of reasons, what 
their producers intended or what we feel we need, and that 
without correction or repair they may be misleading guides 
to the past, and without innovative change they may seem 
unsatisfying. 


To Tanselle, “Such editing sought to establish a fixed, definitive 
text, usually theorized as an ur-text marred by subsequent cor- 
ruption in transmission.” !! The concern regarding the displace- 
ment of work within the editing process would figure predomi- 
nantly in concerns about digital reproductions of literature, most 
often a fear of loss of editorial control that will be discussed later 
in this chapter. 

Those scholars invested in Greg-Bowers editorial prac- 
tices would also feel threatened by the displacement of editing 
within the American academy. While textual studies work was 
considered a central aspect of literary studies during the early 
to mid-century, by the 1990s deconstruction and high literary 
criticism had driven textual studies to the borders of the field. 
Post-structuralists rejected the materiality of the text that those 
invested in editorial work relished, broadening the concept of 
text to a definition far more amorphous than that embraced by 
those in the Greg-Bowers camp. Theorists such as Derrida re- 
fused the physical constraints attached to text, arguing for “a 
‘text’ that is henceforth no longer a finished corpus of writing, 
some content enclosed in a book or its margins, but a differential 
network, a fabric of traces referring endlessly to something other 
than itself, to other differential traces.” In response to Harold 
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Bloom’s similarly stated comment that “there are no texts . . . but 
only interpretations,” Thomas Tanselle responded, “he is obvi- 
ously equating ‘texts’ with ‘works’ and asserting that works have 
no meanings independent of the interpretations of those who en- 
counter them."? Such statements as those made by Bloom and 
Derrida attacked core values of those who embraced the Greg- 
Bowers editorial approach, with Tanselle noting: 


In recent years there has been an increasing tendency for 
literary critics to refer to literary works as “texts.” In con- 
sequence, the term “textual criticism” has become ambigu- 
ous, some people regarding it as a synonym for “literary 
criticism.” Traditionally, of course, “textual criticism” has 
meant the scholarly activity of studying the textual histo- 
ries of verbal works in an effort to propose reliable texts 
of those works (according to one or another definition of 
correctness).!* 


Articles published in textual studies journals, “Textual and Liter- 
ary Theory: Redrawing the Matrix” by D.C. Greetham, “Textual 
Criticism and Deconstruction” by G. Thomas Tanselle, and “Text 
as Matter, Concept, and Action” by Peter L. Shillingsburg, high- 
light the conflict. In Greetham’s words, textual scholars took “on 
post-structuralists in a direct struggle for the body of the text."? 
'The tension between criticism and textual studies has not dis- 
solved and the legacy of that tension had a lasting impact on the 
development of digital humanities within the American academy. 

'The centrality of the digital edition form has intimately con- 
nected digital literary studies to traditional textual studies ap- 
proaches in the minds of many critics, in turn replicating splits 
between textual studies and literary criticism. The rejection of 
textual studies by literary criticism has been discussed in great 
detail within textual studies, but there has been little consider- 
ation of the duplication of such splits within digital humanities 
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because of the roots of textual studies. In part, the rejection of digi- 
tal literary studies has occurred because of the legacy of associat- 
ing edition building with mechanical, applied work, leading to 
the charge of uncomplicated, simplistic, and mechanistic digital 
literary studies work. Michael Groden sums up the original tex- 
tual studies/literary criticism divide: “Literary theorists and crit- 
ics have tended to see editing and bibliography as activities that 
are preliminary to criticism and the textual theorists and critics 
themselves as concerned only with empirical evidence, often with 
minute details (commas, watermarks)."!6 In his notorious “The 
Fruits of the MLA," Edmund Wilson argues that textual editors 
have monopolized and suppressed the pleasure of literature and 
dampened the impact of literature across the wider culture.'’ The 
charge of overt technicality and devotion to minutia at the exclu- 
sion of literary pleasure is similar to critiques of digital editing. 
In Ian Small’s understanding of a digital editor, “he or she must 
cease to edit, in the sense of exercising any form of control or 
judgment. The postmodernist hypertext editor apparently needs 
only to supply data; he or she need not order it." Small continues 
by representing the editor as powerless: “In the process, though, 
that editor appears also to have been stripped of any effective 
agency, authority, or responsibility . . . The logic of such a move 
would be to de-skill and demote the very individuals, text-editors 
and text-theorists, whose interests it is supposed to promote.”'® 
The charges against digital editing are long standing, nearly en- 
grained in contemporary critical approaches, which views edit- 
ing, whether print or digital, as a return to conservative critical 
approaches to literature. Leroy F. Searle points out that “[flor 
an earlier generation, the vocation of editorial scholarship often 
seemed a haven (if not the very citadel) of intellectual probity, 
in which one could practice a science—mild and respectful, if 
sometimes dull—without being drawn into the relatively unreg- 
ulated life of literary criticism and theory, where, as I.A. Richards 
remarked after a lifetime of experience with it, ‘an indecent dis- 
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regard of fact is still current form.”!? The argument voiced by 
Searle is part of a contiguous arc, where the history of textual 
studies work has strongly influenced the way in which digital hu- 
manities has become understood. 

Further, the displacement of editing and the impact on digital 
humanities has happened within a particularized national con- 
text that has adversely impacted American work in digital editing. 
Scholars including Hans Gabler, G. Thomas Tanselle, and Jerome 
McGann” are quick to point out that the treatment and trajec- 
tory of the field differs greatly by nationality. Robert Hume, in 
“The Aims and Uses of “Textual Studies,” notes that textual stud- 
ies has reached a *. . . low standing . . .in North American English 
departments .. . Few major institutions emphasize editing or bib- 
liographic scholarship, and bright students are rarely encouraged 
to take up these lines of work. The bibliography/literary criticism 
dichotomy has become a chasm over the last twenty or thirty 
years, with critics increasingly neglectful and even contemptuous 
of bibliographic scholarship." The lack of standing in North 
American departments, where textual studies work is often con- 
sidered “just editing" by those invested in literary criticism, is not 
so in the European academy where textual studies continues to 
hold an important position within academia. Perhaps this is be- 
cause, as Hume notes, Anglo-American editing is distinctly dif- 
ferent than European editing, where “literary editing has rarely 
been carried out with much respect for the Greg-Bowers pro- 
gram."? Hans Gabler likewise points to the “dichotomy between 
criticism and scholarship,” which he argues is “an American divi- 
sion . . . in the first place; responsible, I believe for much in the 
present modern topography of the academic landscape in Eng- 
lish, American, and modern languages and literatures; and never 
whole-heartedly embraced as a mode of self-definition in literary 
studies in Europe."? The digital turn has done little to bring Eu- 
ropean and American attitudes toward textual studies work to- 
gether. Europeans have produced a far greater number of digital 
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editions and the form continues to have a great deal of currency. 
Though digital editions are currently produced in the United 
States, the position of editing within the American academy has 
meant that those working on such projects continue to find their 
work stigmatized. 

However, the impact of editorial displacement from the 
mainstream American academy is not equally distributed. The 
dominance of the Greg-Bowers approach was contested by those 
who championed a reevaluation of the theoretical framework of 
textual studies, such as Jerome McGann, D. F. McKenzie, and 
David Greetham. McGann took on the Greg-Bowers school’s 
approach to editing in the early 1980s after his experience with 
editing Byron. In his 1983 Critique of Modern Textual Criticism, 
McGann challenged those that privileged the position of author 
in the text, arguing that such ideas *so emphasize the autonomy 
of the isolated author as to distort our theoretical grasp of the 
‘mode of existence of a literary work of art’ (a mode of existence 
which is fundamentally social rather than personal)."* Designed 
to represent a production process rather than individual moment 
of creation, the social text criticism proposed by McGann is de- 
scribed by Greetham as “an alternative view of composition, in 
which the entire history of the work is a fit subject for textual 
scholarship, and even posthumous changes by editors, publish- 
ers, friends and relations, are to be considered a perfectly valid 
part of the text read as a social construct."^ McGann’s challenge 
to the Greg-Bowers approach generated not only a great deal of 
tension within the field but would lay the groundwork for the 
move from digital edition to digital archive that will be discussed 
in the next chapter. 

During the height of digital editing, editors invested in pro- 
ducing high quality scholarly editions were increasingly con- 
cerned about the future of edition production. Regardless of 
how the scholar viewed his or her school of editing, limitations 
of print technologies and the economics of scholarly publication 
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increasingly constrained textual and bibliographical scholarship. 
Peter Shillingsburg provides a useful analysis of issues of length, 
completeness, and the economics of scholarly publication in a 
1996 article: 


A scholarly edition is a thick book (five hundred to a thou- 
sand pages) printed on acid-free paper guaranteed for 350 
years, in sturdy bindings, with a list of ten to twenty editors 
and advisory editors, published by a reputable academic 
press and costing a minimum of fifty dollars, but more of- 
ten over one hundred. It contains a Pure Virgin Text or, 
unironically, a Fully Restored one. Already a thick tome be- 
cause of the historical and textual introductions and textual 
apparatus, scholarly editions frequently exclude explana- 
tory annotations because the space they require would add 
unduly to the cost (already out of the reach of ordinary 
mortals and nearly out of reach for the ordinary research 
library).”° 


Print, as Shillingsburg and many other critics have pointed out, 
is very stable. It is a known form, has a workflow for production, 
and survives over a fairly long time period. While acknowledg- 
ing that print is superior in many ways, textual studies scholars 
recognized that the form was plagued with worrying restrictions. 
Scholarly editions are expensive to publish and page numbers are 
limited." The annotations and apparatus are often restricted or 
excluded, driven by economic and print size concerns that lead 
to the production of an unsatisfactory scholarly product. Even 
more alarming since the publication of Shillingsburg’s article, the 
scholarly publishing field has contracted almost to the point of 
nonexistence. The MLA Ad Hoc Committee on Scholarly Pub- 
lishing presented sobering findings in their 2002 report,”* noting 
that declining subsidies of university presses and decreases for li- 
brary acquisitions has caused a substantial drop in sales while the 


20 * TRACES OF THE OLD, USES OF THE NEW 


numbers of faculty who are required to produce a monograph 
publication for tenure and promotion has increased, putting un- 
due pressure on the already fragile system. Scholarly editions, in 
particular, stated the committee, are under threat because of cost, 
time of production, and NEH funding cuts.” Such constrictions 
drove those interested in producing editions to consider experi- 
mental forms of the digital edition. 

Imagining how the digital environment would best serve tex- 
tual studies consumed scholars in the 1990s. Even the most tra- 
ditional textual studies scholars recognized that they needed to 
confront the digital, whether to embrace, to alter, or to reject the 
new technology. Richard Finneran would call the digital a “fun- 
damental paradigm shift,” and David Greetham, who imagined 
“the fully open, scriptable, postmodernist edition of literature,” ar- 
gued that we “will need the facilities of electronic, reader-driven 
editions to achieve the flexibility and lack of closure that differ- 
ance observes.”*! Still other scholars, such as Peter Shillingsburg 
and Jerome McGann, developed digital tools and editions and 
argued for the centrality of digital work to textual studies. Even 
G. Thomas Tanselle recognized that the digital was useful to edi- 
torial work, though he posited a far more constrained view of 
the power of technology: “Computerization is simply the latest 
chapter in the long story of facilitating the reproduction and al- 
teration of texts; what remains constant is the inseparability of 
recorded language from the technology that produced it and 
makes it accessible.”*? While these scholars didn't agree about 
how the digital should be applied to textual studies work, they all 
acknowledged that the digital would impact scholarship. 

During this period, groundbreaking scholarship would pro- 
duce a textual studies core within digital literary studies. From 
Peter Shillingsburg’s early Scholarly Editing in the Computer Age 
to Jerome McGann’s later Radiant Textuality, many textual stud- 
ies scholars professed their investment in technology and the 
text. Collections, including The Literary Text in tbe Digital Age 
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and Electronic Textual Editing, helped to define digital humanities 
with a particularized textual studies slant.” The very same ten- 
sions that caused scholarly battles among editors of print editions 
would emerge in the creation of digital editions, a legacy that re- 
mains with us today. As scholars began to imagine how the digital 
might help them build a better edition, they also were forced to 
engage, once again, in the very debates that split textual studies 
practice. What, exactly, is a text? What is the role of the edi- 
tor? How does one understand an individual item in relation to 
many items? For a portion of American textual studies scholars, 
primarily those committed to the Greg-Bowers school, digital 
editions emphasized a continued integrity of the object of study. 
Experimental digital editions including The Electronic Beowulf, 
The Canterbury Tales Project, and The Piers Plowman Project* ad- 
opted tool-based technology approaches to enact best practices 
in the new environment and launched digital editing that focused 
on what the MLA Committee on Scholarly Editions described as 
“the scholarly edition’s basic task"—"to present a reliable text." 
Concerns about reliability in the new digital environment were 
central to presentations of texts, with Peter Shillingsburg sound- 
ing the alarm over poor quality texts found in online sources 
like Project Gutenberg and about an overreliance on comput- 
ers, which Shillingsburg describes as “just a convenience."?^ This 
early treatment of tools as a means to the end, tools as applica- 
tions to manipulate the text into a representative form, would 
remain one important legacy of textual studies. 

Perhaps the greatest impact of the textual studies tradition is 
the emphasis on a whole text approach, a digital edition that is 
not deformed and a trustworthy text that has been subjected to 
editorial control. Peter Shillingsburg has very vocally argued for 
a holistic approach to text, “a recognition of the textual condition 
understood whole," and perhaps his words have become more 
strident in recent years as datamining and visualization propo- 
nents have pushed for a destabilized or even fragmented text." 
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In Scholarly Editing in the Computer Age, Shillingsburg restates 
“editing is, above all else, a matter of forms.” The “forms, the 
details of presentation, are often thought to be the responsibility 
of editors.”** Accordingly, tools, whether the computer, interface, 
database, or collation program, were designed to produce edi- 
tions that emphasized continuity of form and a clear recognition 
of materiality. The centrality of form that Shillingsburg points to 
has transferred into digital literary studies and is apparent in the 
way in which interface represents the physical object. For many 
early edition projects, the digital interface mimics the traditional 
book structure and includes core elements of the book such as 
the table of contents, page display, and index. For example, when 
Studies in Bibliography (SB) was brought online by the University 
of Virginia Electronic Text (Etext) Center in 1995 it maintained 
the centrality of the book through the interface (see fig. 1.1). 
Form, in such editions, was a correlation to materiality, not a 
means of manipulation. In his foreword to Electronic Textual Edit- 
ing, lanselle?? accepts that the digital is a useful medium in which 
to publish the edition, but warns that “when the excitement leads 
to the idea that the computer alters the ontology of texts and 
makes possible new kinds of reading and analysis, it has gone 
too far.”* The binary that Tanselle sets up— digital as a tool ver- 
sus the digital as a means to new forms, ontologies, reading, or 
analysis—is a theme replicated across digital textual scholarship 
and projects. Or, as Speed Hill has written, “I can live with tech- 
nological change per se, but I fear the more fundamental shift 
in the aims and purposes of scholarly editing that threatens the 
work we invest in preserving the artifacts we cherish. Technolog- 
ical change overvalues the new, the computer-hip, the gee-whiz 
factor, while devaluing editions that appear in the form and for- 
mat of the traditional code."* David Gants also sees the digital 
environment as a way to represent “a well-designed electronic 
edition" that "can exploit the flexibility of the digital medium and 
avoid the need to deform the text; it can shift and adapt to the 
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needs of the individual user, encouraging us, as Tanselle notes, 
to become collaborators."? In each of these understandings of 
digital editing the end goal remains the same— "to avoid the need 
to deform the text." Even those who are some of the staunchest 
supporters of digital innovation recognize that early digital edi- 
tion work did little more than replicate the print structure. Pe- 
ter Robinson, for example, acknowledges the imitative qualities 
of early digital textual studies projects: “The first missing aspect 
is that up to now, almost without exception, no scholarly elec- 
tronic edition has presented material which could not have been 
presented in book form, nor indeed presented this material in a 
manner significantly different from that which could have been 
managed in print."? It is the "sameness" of digital that is the reas- 
suring thread that runs through early digital work, and the hall- 
mark of the early digital edition is the stability, the reassurance, 
of a form that replicates print. 

While the form remained stable, scholars hoped that tech- 
nology would allow editors to create digital versions that would 
be “better than” print editions. Rejecting what some saw as the 
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textual instability advocated by social text theorists like Jerome 
McGann, digital editors tried to create a digital replica, a work 
in much the sense of Tànselle. States Shillingsburg, “In the ‘work 
represented’ I argue that just as a researcher in a library request- 
ing the first edition of a work would reject a transcription of that 
edition as a basis for research, so a researcher using an electronic 
edition should also reject a transcript. An image is, after all, as 
close as one can get, electronically, to the original. The transcript 
becomes a convenience for searching, while the original (or a 
good image of it) continues to be used as the real thing.”* Hill 
echoes this critique in his response to McGann’s digital work, 
which he says embraces “a device designed for and dedicated to 
the de-materialization—for that’s what the term ‘digitization’ 
really means—of that very same material artifact into a signal 
stream made up of zillions of offs and ons, wholly dependent on a 
complex infrastructure over which none of us has any control."* 
The centrality of the material object would loom large in early 
digital edition development, as editors battled to assert control 
over the textual representation. 

The Electronic Beowulf (1994) is an exemplar of the work made 
stable within the digital environment. Rife with textual studies 
apparatus, Beowulf includes multiple manuscripts, transcriptions, 
definitions and other types of support materials, and illustrations 
from the one original manuscript that “far exceed the appearance 
of any published black-and-white facsimiles,” effectively becom- 
ing a value added facsimile edition, an idealized edition that is 
broader and more complete than any previous print edition. 
The “better than print” edition allows scholars to thumb through 
the fragile pages of the physical manuscripts, gathered together 
in their entirety rather than spread throughout numerous librar- 
ies around the world, and to use technology to collate the various 
transcriptions. Further, the digital edition contains *hundreds of 
fiber-optic readings of hidden letters and ultraviolet readings of 
erased text from the early 11th-century manuscript" and “read- 
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Fig. 1.2. A manuscript image and transcription from the Electronic Beowulf 
CD-ROM. (From William Kilbride, reviewer. “Whose Beowulf is it 
anyway? Review of Electronic Beowulf [CD-Rom].” Internet Archaeology 9, 
2000. http://dx.doi.org/10.11141/ia.9.12.) 


ings that were visible to J. J. Conybeare in 1817, before the rọth- 
century binding frames covered the burnt edges of the manu- 
script; and selections from other documents needed to restore 
the damaged text"" (see fig. 1.2). 

'The ability to utilize computer techniques to find hidden let- 
ters, erasures, and to reinstate lost sections of the text encapsulates 
the age of the digital edition where technology was used *to draw 
attention back to the manuscript." Further, the Beowulf edition 
enforces the idea of the work, where the restored text is as close 
to the idealized original as possible. Technology, in the age of the 
digital edition, is useful because it allows the editor to publish the 
material text as it was believed to be originally constructed. 

Joe Viscomi, an editor of The William Blake Archive, likewise 
sees the digital environment as a way to resolve problems found 
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in the production of a traditional print edition, in the case of 
Blake's work how to represent his multiple hand-colored poems. 
Editors interested in Blake found the digital edition appealing 
because Blake's work was severely limited by print production. In 
print editions, editors either developed a text only edition, which 
was inexpensive, a monochromatic reproduction, and which left 
out the details of the original, or hand-colored collotypes, often 
prohibitively expensive.” The Blake Archive developed protocols 
to replicate the uniqueness of the individual images in the digital 
environment, producing a stunning and groundbreaking archive, 
an edition that fully recognizes the individual material text, a 
high-end digital facsimile. The Blake Archive supplies “reproduc- 
tions that are more accurate in color, detail, and scale than the 
finest commercially published photomechanical reproductions 
and texts that are more faithful to Blake’s own than any collected 
edition has provided"? (see fig. 1.3). 

'The project was awarded the 2003 MLA Prize for a Distin- 
guished Scholarly Edition, an honor rightly bestowed on the 
edition for its use of the digital to make a better edition, one 
of the most fully realized representations of the textual studies 
tradition brought digital. Yet I want to emphasize that the Blake 
Archive’s success depends in large part on its recognizable form 
that bridges the gap between traditional textual studies work and 
digital humanities. The Blake Archives insistence on the central- 
ity of image to the digital edition speaks to the continued dis- 
trust of technological interpretations of the original work. The 
editors note, “our images are not intended to be ‘archival’ in the 
sense sometimes intended—virtual copies that might stand in 
for originals after a fire,” even while they make every attempt to 
reproduce the physical object, from scale to resolution to color 
suggesting that the editors continue to view the material object 
as unique and, ultimately, unreproducable.*! The tension be- 
tween the physical object and how it is represented, the ultimate 
distrust of the technological representation would be a hallmark 
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of the digital edition. Philip Cohen is one of many editors who 
were concerned about the increasing ability of anyone to edit and 
produce a text, of digital programs that “allow readers to create 
different combinations of the extant texts.” To Cohen, and other 
editors, electronic approaches will not mean “the end of textual 
scholarship because assembling electronic editions and account- 
ing for textual variations will still require print.”*’ Editors hoped 
that the digital environment would allow for improved editing, 
but they resisted what they feared would lead to a deformation of 
the text and the possibility of unschooled editorial intervention. 
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Another way to fix the text and to assure quality within the 
digital environment was to partner with scholarly presses. Dur- 
ing this period the University of Michigan produced Piers Plow- 
man; the University of Kentucky, in partnership with the British 
Library, produced the Electronic Beowulf, the University of Vir- 
ginia produced the presidential papers, including the Adams and 
Thomas Jefferson Papers, and a variety of nineteenth-century 
literary editions, such as Clotel by William Wells Brown and Her- 
man Melville’s Typee; and Cambridge University Press published 
portions of the Canterbury Tales Project. New organizations were 
formed to ensure the quality of digital publications. SEENET, the 
Society for Early English & Norse Electronic Texts, publisher of 
Caedmon's Hymn, Piers Plowman, and other digital editions, “was 
established in the 1990s to gain the benefits of new electronic 
technology without sacrificing what scholars have learned about 
textual criticism and its sister disciplines"? SEENET’s digital 
editions were modeled on a “long-established book publication 
series, Medieval Academy Books (MAB), which is overseen by the 
Publications Advisory Board and published in collaboration with 
the University of Ioronto Press. Like MAB, the SEENET series 
focuses on editions and scholarly tools of importance to medieval 
studies. And like MAB, the SEENET series is made possible by 
working in partnership with an established press.” SEENET 
utilized a production to distribution process that closely followed 
print production, a model common among scholarly publishers 
interested in going digital. The press would accept a scholar's 
work and then vet that work through their network of experts, as- 
suring the quality of the edition. Once the project was accepted, 
the press would develop an infrastructure through which to pub- 
lish the project. The user would purchase either a subscription 
to a web-based edition or a CD-ROM. The workflow was re- 
markably similar to that of a print edition, which maintained the 
various checks on quality and control over output, a means of 
assuaging fears of digital instability.” Concerned about textual 
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production in an age where editorial control was eroding, editors 
reasserted their centrality by linking their work to the known 
marker of authenticity, the scholarly publishing house. 

The imprimatur of the scholarly publishing house helped to 
validate the new digital form, but scholars remained committed 
to a physical version of their edition, the CD-ROM. The Com- 
pact Disc Read-Only Memory was a data disc that could be im- 
printed and locked, immune to manipulation by the user. Like a 
book, a CD could be published, distributed, and owned. It could 
be placed on a shelf, taken down and perused, then returned to 
the shelf to be enjoyed at a later date. Peter Robinson, who ini- 
tially published his ambitious Canterbury Tales digital editions 
with a scholarly press, Cambridge, and later his own publishing 
company, Scholarly Digital Editions, argues that the publica- 
tion methods in the early to mid-199os made the CD-ROM the 
preferred form of publication as scholars believed it to be a far 
more stable technology than the web.?^^'T'he CD-ROM allowed 
the publisher to replicate the print infrastructure for publishing, 
acquiring and lending, making it, according to Hockey, the “me- 
dium of choice for many electronic publications, largely because 
it is easier for publishers and librarians to handle. It fits in better 
with procedures for handling books and in some ways can be 
seen as an extension of them.”*’ The CD, with its ability to be 
transported, to be bought and sold, with its ability to mimic the 
book's organizational structure was a reassuring form to those 
concerned about the decline of editorial control. 

Other advances in digital literary scholarship, such as the TEI/ 
XML, are in line with concerns about the stabilized, editorially 
controlled whole text. The TEI was established in 1987 by The 
Working Committee on Text Encoding Practices of the Associa- 
tion for Computers and Humanities and, over its almost forty 
years of development, has risen to become the de facto encod- 
ing approach for digital texts. The members of the initial TEI 
Working Committee” requested an NEH Emergency Grant 
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to “develop and promote guidelines for standard text-encoding 
practices in preparing machine-readable texts for scholarly re- 
search." The committee decided to modify SGML with “a set 
of markup tags, and to define how they can be used."^ By 1990 
the TEI Working Committee would publish its first guidelines, 
and in January of 1999, the TEI consortium was formed. The 
'TEI is a useful tool and mastery of T'EI markup is often con- 
sidered to be a base skill for digital literary studies, but practi- 
tioners admit that it is problematic for certain types of editorial 
purposes. Numerous scholars have pointed to the limitations of 
the TEI for complex textual representations, but an examina- 
tion of the history and evolution of TEI suggests that the prob- 
lem lies, to some degree, in the TEI’s initial conception, as the 
TEI was not originally constructed for use by literary editors. 
The 1987 TEI NEH grant shows participation from linguists 
and literary scholars, but of those involved in the 1987 Working 
Committee not one of the originators of the TEI was a literary 
textual scholar. Lou Burnard and C. M. Sperberg-McQueen had 
backgrounds in English and comparative literature but worked 
within computing. David Barnard and Nancy Ide were in com- 
puter science departments. David Chesnutt was a historian and 
documentary editor working with historical papers. The special- 
ized needs of edition building literary scholars, then, were not 
central to the goals of the original TEI working group. Further, 
at the historical moment when TEI was proposed, historical and 
literary editorial methodologies were divergent. Clashes between 
historians and literary editors were driven public by G. Thomas 
Tanselle, who delivered a blistering talk at the American Docu- 
mentary Editing conference in the late 1980s condemning his- 
torical editing. Tanselle complained in his related 1986 article 
“Historicism and Critical Editing”: 


Even though many historical editors have practiced criti- 
cal editing in the sense that they have normalized or regu- 
larized certain features of their texts, and have not simply 
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produced diplomatic transcriptions, many of them have not 
been able to see the value of the further step that literary 
editors have often taken when dealing with multiple texts 
of a single work, the step of emending one text with vari- 
ants from another. Not having progressed beyond this ele- 
mentary stage in the process of thinking about editing, they 
have not been in a position to enter into the more sophis- 
ticated discussions of historicism in critical editing. It is an 
unfortunate fact that what historians have published on the 
subject of editing has not contributed to the development 
of editorial theory.” 


The initial NEH grant notes that TEI will focus on the needs “of 
text-oriented historical research (including documentary edit- 
ing) which are similar in nature"? and states that the consortium 
would invite participation from the American Documentary Ed- 
iting group, a scholarly group that draws membership from his- 
torical and literary editors. While this statement seems inclusive, 
the grant qualifies its inclusion by stating that they would invite 
the American Historical Association and historical editors, not 
literary editors or members of the Modern Language Associa- 
tion. This strange elision suggests that at least some members 
of the TEI consortium were well aware of the tensions between 
historical and literary editing and, with this notation the TEI 
was initially attuned to historical editing at the exclusion of liter- 
ary editing.^ The exclusion of literary textual editing from the 
formative TEI grant implies that the type of markup initially 
imagined would be more useful to historical editing than liter- 
ary editing. Therefore, it is not surprising that the TEI has been 
criticized by literary scholars interested in using the markup lan- 
guage. McGann states, 


TEIs emergence exposed the deep flaw in the TEI rep- 
resentation of texts. A text is not an Ordered Hierarchy 
of Content Objects; it is a manifold of an indeterminate 
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number of possible ordered hierarchies. Every text, every 
element of every text, is n-dimensional, depending on what 
you choose to regard as contextually relevant. This basic 
truth about representational media of all kinds, not just 
"texts", did not become so graphically apparent to scholars 
until the TEI consortium set out to implement its alter- 
native conceptual design. The failure of the OHCO thesis 
proved to be the TEI’s greatest contribution to textual and 
media studies, and to the further development of the TEI 
itself. 


TEI works well for marking the structure of the text, but repre- 
sentation of complicated editorial markings and variants that a 
scholar editing a literary manuscript might be interested in rep- 
resenting presents a much more difficult challenge. While those 
involved with the Poughkeepsie Principles, the initial stated goals 
of the TEI, were not outwardly hostile toward literary editing, the 
apparent differences in methodology cannot be overstated. The 
1999 formation of the TEI consortium signaled the beginning of 
the TEI’s desire to meet the needs of literary editorial scholars. 
With John Unsworth’s leadership, the MLA was named a partner 
in the consortium, signaling a ground shift in the TEI’s approach 
to literature. The consortium agreement emphasized that “[t]he 
key constituencies for the TEI, then, are the various scholarly 
communities involved in text encoding (linguists, historians, tex- 
tual theorists, literary historians, etc.), and libraries . . "^ Though 
the TEI shifted course, the legacy of their initial methodology 
continues to produce certain well documented limitations for 
those interested in applying TEI markup to literary texts. For 
Peter Shillingsburg and other editors, a whole text solution to 
digital editing would be solved through alternative technological 
solutions. Concerned about “file security and avoiding the con- 
flicts and corruptions endemic to Wiki solutions,” editing of texts 
by the masses, Shillingsburg has repeatedly called for a stand- 
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off markup solution, where a text remains untouched and ma- 
nipulation occurs outside of the pristine whole.” Shillingsburg, 
like most digital editors of this period, is concerned that highly 
trained editors and their skills are devalued in the new environ- 
ment and, in response, he calls for “a tagging tool that will allow 
textual scholars (not highly trained technical assistants) to associ- 
ate their analytical tagging as stand-off mark-up to already exist- 
ing texts." Such an approach will *allow others to add tagging 
without affecting the files already on offer," protecting the text 
from errors.^ Standoff markup is a solution posed to protect the 
editorially controlled text, a direct response to the potential for 
manipulations of texts marked with TEI. 

'The digital edition has had a great impact on digital literary 
studies and remains a central, visible form of digital literary pro- 
duction, so successful that Kenneth Price has stated that “digital 
work has achieved primacy only for editions.”® Current projects, 
such as the Melville Electronic Library, continue to explore how 
best to position the editor in relationship to technology.” For 
Melville, and project director John Bryant, editing is a far more 
interactive process than imagined by early digital edition cre- 
ators, with technology a helpmate for collaboration. Following 
Bryant's theory of fluid textuality, where texts are always evolv- 
ing due to interventions from a myriad of participants, Melville 
rejects a stabilization of one version of a text and makes visible 
multiple versions and changes, recording each instance while al- 
lowing users to make and record their own interventions. Clearly 
one legacy of the digital edition work is a continued evaluation 
of the position of the editor and the text. The primacy of the 
digital in edition production has also shifted the future of digi- 
tal editions. Edition building is now most likely to occur in the 
digital environment, driven by limitations of press support, ex- 
pansive opportunities for image-based editions, and technologi- 
cal manipulation of materials. Even long-standing print editing 
projects, from the Founding Father's documents to Shakespeare, 
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have turned to digital publication. On the other hand, digital edi- 
tion practices were and are interwoven into digital literary stud- 
ies, impacting the way that we understand current digital literary 
studies production. 

As this chapter reveals, questions regarding scholarly in- 
tervention, the material object and digital surrogates, and the 
treatment of computer deformation of texts are all legacies of 
the digital edition form. Unfortunately the history and impact 
of digital editions is often overlooked or misunderstood by 
many digital humanities practitioners. Editions are recognized 
early digital forms, but textual studies approaches, from TEI to 
interface design, have become naturalized in the field without 
a clear understanding of their historical context. Further, some 
in the digital humanities community have begun to view digi- 
tal edition building as technologically unsophisticated, in effect 
echoing Ian Small’s view of editing as “a largely pragmatic, un- 
sophisticated activity.””! Financial support for digital editing is 
difficult to obtain and recognition of scholarly skill required for 
edition building remains low. On the other end of the spectrum, 
however, is the overreliance on the edition form, long a prob- 
lem in textual studies. Jerome McGann warned of the dangers 
of such an approach in his predigital 1985 essay “The Monks 
and the Giants: Textual and Bibliographical Studies and the In- 
terpretation of Literary Works”: 


If textual and bibliographical studies are to have a significant 
impact on literary interpretation, textual criticism will have 
to be reconceptualized along lines that transcend an edi- 
torial theory. Of course, an editorial perspective on the 
principles of textual criticism is imperative under certain 
circumstances. Nevertheless, such a perspective only tends 
to obscure matters when the central issue is the relation of 
textual scholarship to literary meaning.” 
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McGann’s warning continues to be appropriate. An overreli- 
ance on a particularized approach or methodology that excludes 
expansion or innovation is as dangerous as the lack of historical 
understanding of textual studies approaches. Perhaps Margaret 
Ezell has articulated this most concisely. In discussing the move 
to “e-editions,” she notes, “it becomes clear to me that while we 
increasingly have the ability to digitalize any text we please— 
although there are certainly grounds for debate how well the 
digitalized images capture the features of the material original— 
editors do not please to select certain types of material and this 
is in part because perhaps we are not yet changing some of the 
basic assumptions about what an ‘edition’ does, or in Hunter’s 
terms, what is fappropriate.""? We need to be mindful of, but not 
bound to, textual studies approaches revealing how the digital 
edition has transferred certain representations into naturalized 
digital literary practice. 

Textual studies has a problematic relationship to diversity that 
has unfortunately transferred to current digital literary scholar- 
ship. Martha Nell Smith has pointed to the rigid and exclusive 
normative practices in textual studies and calls for editing to 
“take into account the ‘messy’ facts of authorship, production, 
and reception: race, class, gender, and sexuality.”’* Of her first 
attendance at the Society for Textual Studies (STS) conference, 
the major international society meeting for academic editors, she 
noticed that *[m]ore than a few participants in STS seemed to 
think of it as a space free from all the messiness of questions of 
identity and politics."?^ While editors may have seen themselves 
as forced out of the mainstream of academic literary studies, 
their resistance to issues of gender, race, class, and sexuality in 
large part made them outsiders to normative practice within the 
broader field. Julia Flanders has examined the early treatment of 
gender in editing and argues that “the error-ridden manuscript 
is figured in the rhetoric of the humanist scholar, as an unchaste 
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female body which has suffered 'corruption' . . . as a result of 
sexual attack.""6 The editor, then, must restore “a lost wholeness” 
by “intervening in the text and altering it yet more so as to cover 
over the places of its corruption.””” The figuration of the cor- 
rupt text is apparent in the Greg-Bowers approach to editing as 
well. Fredson Bowers, to many the grandfather of modern textual 
studies, was obsessively concerned about issues of textual purity. 
In Textual Criticism and the Literary Critics he writes, “the most 
important concern of the textual bibliographer is to guard the 
purity of the important basic documents of our literature and 
culture . . . One can no more permit ‘just a little corruption’ to 
pass unheeded in the transmission of our literary heritage than 
‘just a little sin’ was possible in Eden.””* Bowers’ correlation be- 
tween textual purity and Eve’s original sin, the “little sin” that 
caused the fall from Eden, reveals just how deep-seated prob- 
lematic representations of gender are to the formation of ideas 
regarding editing. In fact, Bowers’ interest in purity has an odd 
correlation to his hobby, dog breeding. In his tribute to Bow- 
ers, “The Life and Work of Fredson Bowers,” Tanselle points to 
Bower' interest in breed standards of the Irish Wolfhounds that 
he bred, showed, and eventually judged: 


That Bowers should have involved himself actively in this 
matter [breed standards] is not surprising, for his mind was 
attracted to categorization and systematization, and the 
problem was not unlike the bibliographical question he 
later addressed concerning the description of *ideal" copies 
of books, abstracted from the idiosyncrasies of actual sur- 
viving copies. In one of his most thoughtful columns on the 
wolfhound standard (January 1939), he wrote that *it is of 
the utmost importance that a clear and reasonably definite 
set of rules be laid down for judging any breed if the great 
benefit which dog shows confer upon improving the breed 
is not to turn into a boomerang by reason of such diverse 
judging that no practical ideal can be ascertained." 
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Bowers interest in “rules” related to purity underpins the notion 
that editing has indeed been limited in scope and approach and 
is often exclusionary of particular texts. If the editor's goal is to 
protect the pure text, according to Bowers, then it should come 
as no surprise that textual studies scholars resisted calls to inves- 
tigate issues of diversity. The messiness of such outside forces 
threatened to disrupt foundational ideas about editing. This his- 
tory is part of digital editing, which likewise struggled with how 
to protect the text. 

As this chapter suggests, the digital edition was the most 
prevalent form of the first generation of digital literary work 
in the United States. While textual studies work has not neatly 
transferred into digital literary studies nor has textual studies re- 
mained the dominant mode, the form is still useful and necessary, 
and there is no reason to believe that we won't continue to see 
digital editions produced under the auspices of digital humani- 
ties work. Digital humanists must not lose sight of the textual 
studies methodological and theoretical approaches that underpin 
the field. To do so is not only to negate important early work 
but to deny crucial scholarly projects needed to advance literary 
studies. At the same time, digital literary scholars must recognize 
how such forms impact the types of scholarly questions that we 
might ask, excavating the impact of textual studies methodology 
and theory. 


CHAPTER 2 


The Era of the Archive 


The New Historicist Movement and 
Digital Literary Studies 


Out of the vast array of textual traces in a culture, the identi- 
fication of units suitable for analysis is problematized. If every 
trace of a culture is part of a massive text, how can one iden- 
tify the boundaries of these units? What is the appropriate 
scale? There are, we conclude, no abstract, purely theoreti- 
cal answers to these questions. To a considerable extent the 
units are given by the archive itself-that is, we almost always 
receive works whose boundaries have already been defined 
by the technology and generic assumptions of the original 
makers and readers. But new historicism undertakes to call 
these assumptions into question and treat them as part of the 
history that needs to be interpreted. 


— Catherine Gallagher and Stephen Greenblatt, 
Practicing New Historicism, 14-15 


I begin with one of *my own traveler's anecdotes" from a journey 
to digital humanities.! In 1996, I attended the American Litera- 
ture Association (ALA) meeting held in San Diego, California. I 
was particularly interested in the *New Vistas in Whitman Stud- 
ies" panel during which my former professor, Kenneth Price, 
was to introduce his new scholarly project, something he called 
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The Walt Whitman Archive (WWA). The large group of attendees 
waited impatiently while the conference organizers struggled to 
set up a computer projector, probably the very first such technol- 
ogy used at ALA. As the talk began, murmurs were heard cri- 
tiquing the technological issues that slowed the timely start of 
the presentation. Yet, as Price began to show page after page of 
Whitman’s manuscripts, people in the room started to nod. Such 
was the era of the digital archive, where scholars outside of spe- 
cialized fields of literary studies began to come into contact with 
digital humanities forms. 

Digital edition production peaked in the early 1990s, and by 
the mid-1990s the digital archive began to emerge as the domi- 
nant form in American digital literary studies. The shift from edi- 
tion to archive is not uniform, with digital edition production 
continuing into the contemporary period.’ Instead, the digital ar- 
chive became the dominant, but not exclusive, form in American 
digital literary studies. The shift from digital edition to digital 
archive is related, in large part, to the rise of new historicism. 
While only a handful of scholars working within the new histori- 
cist framework embraced digital scholarship, their incorporation 
of new historicist theories into technological methodology has 
defined the field. Jerome McGann, The Rossetti Archive (RA); Ken- 
neth Price, The Whitman Archive (WWA); Martha Nell Smith, the 
Dickinson Electronic Archives (DEA); Cathy Davidson, HASTAC; 
Alan Liu, Romantic Chronology; and even Stephen Greenblatt, 
with his undergraduate course “A Silk Road Course: Travel and 
Transformation on the High Seas: An Imaginary Journey in the 
Early 17th Century,” have experimented with digital technolo- 
gies.’ Students of these scholars—including Andrew Jewell, The 
Willa Cather Archive; Amanda Gailey, Race and Children’s Litera- 
ture of the Gilded Age; and Craig A. Warren, the Ambrose Bierce 
Project—have continued the trend. Jerome McGanns creation of 
NINES, the Networked Infrastructure for Nineteenth-Century 
Electronic Scholarship, coalesced the individual efforts of schol- 
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ars and positioned the archive in the center of the digital literary 
field. As the digital archive moved to the center of digital literary 
studies, so too did new historicist thinking. Multiple versions of 
one text are not the centerpiece of the digital archive as they are 
in the digital edition; the text is understood to be in conversation 
with an ever-widening gyre of materials that include literary, cul- 
tural, and historical texts. This new historicist conception of the 
archive imagines the text within an expansive system, with the 
textual materials positioned in a network of conversation with a 
wide range of cultural materials. Scholars working within the ru- 
bric of new historicism positioned the physical archive and print 
materials as the centerpieces of their work. The digital environ- 
ment would serve as a mechanism for refining the archive. 

In 1982, Stephen Greenblatt used the term new historicism 
in his introduction to the Genre special issue, The Power of Forms 
in tbe Englisb Renaissance. While there are other, earlier uses of 
the term, Greenblatt’s articulation of new historicism launched 
the emergence, and some might say dominance, of the critical 
practice in the American academy. New historicism perceives lit- 
erature as located within a historical, cultural, and social matrix 
and that this matrix allows for a deep reading of the text in ques- 
tion. At the same time, the scholar is to maintain a self-reflexive 
critical stance in relation to the text. While there is agreement 
on aspects of critical representation of the text, new historicists 
resisted a monolithic definition of their approach. In “The His- 
toricist Enterprise,” Jeffrey N. Cox and Larry J. Reynolds point 
to “sharp differences between the concerns and practices of vari- 
ous New Historicists.” Regardless of dissension, they believe that 
“the enterprise has discernible features." H. Aram Vesser pin- 
points the following as central tenets of new historicism: 


I) that every expressive act is embedded in a network of 
material practices; 2) that every act of unmasking, critique, 
and opposition uses the tools it condemns and risks falling 


The Era of the Archive + 41 


prey to the practice it exposes; 3) that literary and non- 
literary ‘texts’ circulate inseparably; 4) that no discourse, 
imaginative or archival, gives access to unchanging truths 
or expresses unalterable human nature; and 5) that a critical 
method and a language adequate to describe culture under 
capitalism participate in the economy they describe.’ 


Or, in Louis A. Montrose’s definition: 


The writing and reading of texts, as well as the processes 
by which they are circulated and categorized, analyzed and 
taught, are being reconstructed as historically determined 
and determining modes of cultural work; apparently au- 
tonomous aesthetic and academic issues are being reunder- 
stood as inextricably though complexly linked to other dis- 
courses and practices—such linkages constituting the social 
networks within which individual subjectivities and collec- 
tive structures are mutually and continuously shaped.^ 


The shifting definitions remind us that new historicism is, as 
Greenblatt emphasizes, “a practice rather than a doctrine,” and 
while those working within the boundaries of new historicism 
never fashioned a shared definition of the term, these interpreta- 
tions suggest how we might imagine a coherent critical approach 
through which to analyze the digital archive. 

Greenblatt may have launched our contemporary understand- 
ing of new historicism, but Jerome McGann brought new his- 
toricism to the digital age. Donald Waters, Andrew W. Mellon 
Foundation program officer, identifies McGann’s 1983 A Critique 
of Modern Textual Criticism as the text that launched literary digi- 
tal humanities.* Waters's assessment hinges on McGanns5 theory 
of textuality: McGann contends that “the apparitions of text— 
its paratexts, bibliographical codes, and all visual features—are 
as important in the text’s signifying programs as the linguistic 
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elements,” and “that the social intercourse of texts—the context of 
their relations—must be conceived an essential part of the ‘text 
itself if one means to gain an adequate critical grasp of the tex- 
tual situation."? In fact, McGann acknowledges that the theory of 
text articulated in his 1983 volume contributed to his decision to 
experiment with digital scholarship, but he believes that his in- 
troduction *to UNIX computing systems and to hypermedia" in 
the 1980s was equally as important. The emergence of new ideas 
regarding text and technology made McGann decide, “when cir- 
cumstances were right I would undertake building a computer- 
ized hypermedia model for scholarly editing."'? 

McGann found the appropriate circumstances in 1993 when 
the University of Virginia launched the Institute for Advanced 
Technology in the Humanities (IATH) under John Unsworth's 
directorship. Unsworth’s experience with the electronic jour- 
nal, Postmodern Culture, convinced him that the open access, 
web-based delivery of materials, rather than the proprietary 
stand-alone CD-ROM used by the majority of the contempo- 
rary digital edition projects, was most suitable for IATH’s digital 
projects. One cannot overstate how important Unsworth’s in- 
sistence on the web-based model would prove to the future of 
digital literary work, for without Unsworth's leadership we may 
well have continued to produce our digital projects on the CD- 
ROM, a form that has stymied digital textual editions production 
with issues including interoperability, speed of use, and limited 
storage size. McGann reminds us that Unsworth’s web-based ap- 
proach moved "against nearly every current in humanities com- 
puting scholarship, which was dominated by ‘stand alone’ ideas 
and technologies (epitomized in the early and short-sighted 
choice of CD-ROM as the venue for carrying humanities texts 
and hypertexts)."? The RA began as a small HTML prototype, 
what McGann labels “a kind of thought experiment."? Once the 
prototype was developed, the RA discussed expansion through a 
partnership with the University of Michigan Press. The project 
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North American Distributor for Amsterdam University Press, 
The Chinese Untversity Press, & B. G. Teubner Verlag. 
Worldwide Distributor for the Center for Japanese Studies and the 


Center for South and Southeast Asian Studies, University of Michigan. | Fig. 2.1. Univer. sity 
of Michigan Press 
Catalog, fall 1999. 


with the Press proceeded to the point where the website license 
for The Complete Writings and Pictures of Dante Gabriel Rossetti: 
A Hypermedia Research Archive was advertised in Michigan’s fall 
1999 catalogue and a site mock-up featured on the catalogue 
front page (see fig. 2.1). 

The publishing relationship disintegrated when McGann in- 
sisted that the site be open access and extensible, an untenable 
financial arrangement for the Press. The WWA has a similar his- 
tory. After an initial HTML prototype was developed, Price and 
his coeditor Ed Folsom worked with Primary Source Media to 
distribute a less robust version of selected materials as a CD col- 
lection titled Major Authors on CD-ROM: Walt Whitman."* Ulti- 
mately the WWA chose to proceed with a more robust, open ac- 
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cess version of the archive. With their insistence on open access 
web-based projects, the RA, WWA, and other projects of their ilk 
launched new standards for digital literary studies. 

As the digital archive form began to be replicated by other 
scholars, so too would the underlying theoretical model derived 
from new historicism. In 1993, when McGann launched his Ros- 
setti Archive, he coined the term digital archive to describe his 
work and, in doing so, rejected the textual studies linked term 
digital edition that had driven digital humanities work during the 
previous period." The archive offered possibilities that the book 
did not: “When a book is produced it literally closes its covers 
on itself,” but archives, in McGann’s mind, are “built so that its 
contents and its webwork of relations (both internal and exter- 
nal) can be indefinitely expanded and developed." The web of 
relations is crucial to the archive form and is derived in large 
part from new historicist conceptions of the archive. Working 
in reaction to perceived limitations of new criticism and post- 
structuralist criticism, new historicists centered their research 
within the physical archive. Marjorie Levinson aptly calls new 
historicism “a kind of systems analysis,"" a statement oddly pre- 
dictive of the way that computer technologies would be enacted 
in the digital archive and an emphasis on how archives became 
the sort of space in which the scholar would piece together tex- 
tual interrelations. If an intervention into an archive is a sort of 
systems analysis, the intervention is also, in reference to Der- 
rida's Archive Fever, a constructed and deconstructable entity. No 
archive can be “without outside.”!* Archival instability, a legacy 
of Derridian conceptions of power and truth, continues to in- 
form the way that digital literary scholars understand the work 
we undertake. Nowhere is this more apparent than in the 2009 
DHQ special cluster entitled “Done.”’? Underlying this special 
cluster is the insistence that digital works are highly mutable and 
perceptions of completeness are purely subjective. Brown et al. 
write, ^Doneness' circulates discursively within a complex and 
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evolving scholarly ecology where new modes of digital publica- 
tion are changing our conceptions of textuality, at the same time 
that models of publication, funding, and archiving are rapidly 
changing.” The multiple factors influencing the conception of 
completeness as well as the type of projects produced within the 
evolving parameters lend to the charge of instability. However, 
viewed within the new historicist legacy of Derridian archive, we 
might better understand why reading the archive through a com- 
plex set of power dynamics becomes more important than lock- 
ing down a set of protocols of production. 

The digital archive seems designed to meet the needs of the 
scholar interested in producing the “thick description” criticism 
central to new historicist work. Derived from anthropologist 
Clifford Geertz, thick description was the practice of “giving 
the act its place in a network of framing intentions and cultural 
meanings."?! Thick description necessitated the examination of a 
broad selection of materials including literature, political docu- 
ments, art, newspapers, material objects, and more, and new his- 
toricist scholars read materials from the Society for the Preven- 
tion of Premature Burial next to biblical texts and altar clothes.” 
Using “an empirically responsible investigation of the contem- 
porary meanings informing literary works (their parts, their pro- 
duction, their reception), as well as other social texts,” according 
to Marjorie Levinson, new historicists “regard these meanings 
as systematically interrelated within the period in question, but 
since we do not organize the system by a dynamic concept of 
ideology on the one hand, and of structural determination on 
the other, our inquires do not give rise to a meaningful historical 
sequence."? Levinson’s catalog of parts, production, and recep- 
tion harks back to book history approaches, but the emphasis on 
social texts as dynamic and indeterminate prepares the way for a 
world of bits and bites, of interrelated and indeterminate nodes 
of meaning. The digital environment would attempt to represent 
what Brook Thomas calls *a literary work's embeddedness within 
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a larger system of textuality" and could cross “the boundar- 
ies separating history, anthropology, art, politics, literature, and 
economics." Certainly McGann saw the RA as a “self-reflexive 
system," a “laboratory to study books,” and a reflection of so- 
cial text theory. Even critics such as W. Speed Hill recognized 
that the RA was a natural outgrowth of McGann’s theoretical 
approach: 


... If you forego the search for the single, authorially sanc- 
tioned, text-as-end-product-of-the-editorial-process, the 
logic of your position inexorably drives you beyond the co- 
dex and toward the archive. If meaning is dependent upon 
context, and contexts are multiple—indeed infinitive-only 
an infinitely extensible archive can contain the relevant 
data.” 


In the heady days of digital archive development, the web seemed 
to provide a natural test bed for McGann’s theoretical articula- 
tions and a new digital literary genre was born. 

While the digital environment would allow experimentation 
with new historicist ideas, new historicism also afforded the digi- 
tal authority. As discussed in the previous chapter, scholars were 
concerned that the digital space was seemingly unreliable, the 
antithesis of the peer reviewed, press-driven world of academic 
scholarship. How would those interested in producing digital 
materials resolve this problem? One of the means of shoring 
up the reputation of digital work was found in new historicism. 
“The practice of New Historicism," according to Sonja Laden,” 
is also authorized by the archive, or the library, as a more or less 
official repository of records: at once a site for storing a variety 
of material artifacts and documents and . . . a metaphorical site 
of ‘how people imagine what they know and what institutions 
validate that knowledge.”** The choice of the term archive, then, 
does more than describe the gathered materials. The mimesis of 
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the rare book archive, a form understood and given great value 
in scholarly circles, provides a means by which to bring value and 
authority to digital work. Influenced by Derrida’s conception of 
the archive, new historicists well understood the way that their 
use of archive both as authority and construction gave weight 
to the work they commenced. Derrida’s insistence that archives 
“inhabit this unusual place, this place of election where law and 
singularity intersect in privilege,” and "[a]t the intersectional of 
the topological and the nomological, of the place and the law, of 
the substrate and the authority, a scene of domiciliation becomes 
once visible and invisible.”’? Derrida’s conception of the archive 
as law and yet not may indeed point to why those working within 
new historicism were invested in digital archives. Such scholars 
understood that, like Derrida’s archive, the new Internet age and 
the transfer of texts to digital forms were acts of authority, of 
fixing, and of "gathering together" to give form, while the pro- 
cess of creating such slippery human knowledge within computer 
code was impossible, the very deconstruction that Derrida insists 
upon.?? 

Derrida's reading of the archive underpins the modeling of the 
text in digital archives. New historicism treated texts “as objects 
and events in the world, as a part of human life, society, the his- 
torical realities of power, authority, and resistance,”*! but, follow- 
ing Derrida’s line of thinking, new historicism rejects absolutes. 
This duality is reflected in the digital archive's textual representa- 
tion that rejects the sure codifications that were dominant in the 
earlier digital formulation of the edition. Eschewing genres that 
suggested power differentials within criticism, new historicist 
digital scholars rejected “literature” as the most valued term and 
flattened the categories “literature” and “historical document,” 
instead adopting the broader category “digital object.” While the 
term digital object is common in computer parlance (digital ob- 
ject architecture, digital object identifier), digital object is used 
far more loosely in digital literary studies, often slipping between 
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object, text, image, and document, a slippage seemingly related 
to the digital archive's new historicist theoretical roots. While 
the digital edition utilized the simulacrum of the scanned page 
or art image, hence the high-quality digital images contained in 
archives such as the RA or WWA, those constructing the digital 
archive did not fetishize the image as was true in the earlier digi- 
tal edition form. Instead, digital archives emphasized that *ob- 
ject” was a generalized concept much as new historicists used the 
term "text" to apply a lack of preference for form or genre or a 
baseline from which to begin criticism. Unsworth demonstrates 
his preference for the valueless digital object when he argues 
that *it is best if the authoritative name of a digital object has as 
little meaning as possible, and instead conveys the information 
we are tempted to load into the semantics of the name by some 
other means—for example, by breaking it out explicitly in differ- 
ent attribute values, or different database fields, or in some other 
way making it explicit rather than implied. The only thing one 
really wants a name to do, in short, is to distinguish this thing 
from other things, and so the only really required quality of a 
name—in the world of digital objects, at least—is uniqueness." 
Unsworth's concern is with the best way to process a large num- 
ber of related objects, but the emphasis on forcing a gap between 
meaning and object is crucial in this stage of digital production 
where the digital archive relegates the materiality of the text to 
an equal status as the social web of the texts, a very different po- 
sition than that found in the digital edition where the image is 
primary. 

'The rejection of an object's individuated ascribed power is 
greatly influenced by the work of Foucault. A key figure in new 
historicist theorization, Foucault is interested, in the words of 
Stuart Hall, in “the relationship between knowledge and power, 
and how power operated within what he called an institutional 
apparatus and its technologies (techniques)? ideas that form 
the foundation of how digital archivists understand the way by 
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which texts and technologies interact and build knowledge within 
the archive. To Foucault, “The apparatus is thus always inscribed 
in a play of power, but it is also always linked to certain coordi- 
nates of knowledge which issue from it but, to an equal degree, 
condition it. This is what the apparatus consists in: strategies of 
relations of forces supporting, and supported by, types of knowl- 
edge.”** Perhaps we should not be surprised that digital archives 
are imbued with Foucault's ideas of apparatus and his emphasis 
on techniques, roughly translated to technologies, though Fou- 
cault would not have imagined the impact of desktop computers 
when he was writing of such technologies. The digital archive 
balances an understanding of how the technologies, including 
the open access web, metadata, and interface, impact the under- 
standing of an object, hence McGanns chiding of Ed Folsom’s 
representation of the WWA in a PMLA issue focused on data- 
base.” McGann, echoing Foucault’s understanding of apparatus, 
argues that “no database can function without a user interface, 
and in the case of cultural materials the interface is an especially 
crucial element of these kinds of digital instruments. Interface 
embeds, implicitly and explicitly, many kinds of hierarchical and 
narrativized organizations.”* The insistence on reading the ar- 
chive and its power broadly is also a call to balance materiality 
with meaning. Margaret Ezell emphasizes that the digital archive 
model must represent the materiality of the object intertwined 
in a social matrix, in effect creating a balancing act that preserves 
textuality through a systems approach; “That the very materiality 
of women’s handwritten artifacts is its own system of textuality 
can easily be lost again in a system that privileges the linguistic." 
Certainly the move away from an overemphasis on materiality 
that occurs in the digital archive period is influenced by the work 
of Foucault and other new historicists who reject narrow repre- 
sentations of meaning and object. 

In 2005 NINES released Collex, the search and collection 
interface developed for its federated collection of nineteenth- 
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century materials. The interface is a tribute to McGanns5 belief 
that technology is intimately enmeshed with social understand- 
ings of the materials it represents, and, as such, Collex enacts the 
theoretical representation of new historicism's journey into digi- 
tal archive development. “A Ruby-On-Rails application,” Collex 
“allows users to search aggregated sites, collect, annotate, and tag 
the online objects they discover, and to repurpose those objects 
in illustrated, interlinked essays or exhibits.”** Collex allows us- 
ers to explore peer reviewed scholarly projects, selected special 
collections library catalogs, and related commercial collections 
and journals. Yet Collex is far more than a mechanism to search 
through the collected NINES materials as it “embodies interpre- 
tive acts"? and facilitates the new historicist understanding of 
objects within a social system. Collex treats each object as an 
individual item, which allows the user to theorize relationships 
among the objects and to remix the archive accordingly. As us- 
ers build relationships, Collex ingests the findings, recognizing 
"the contexts in which they «individual objects» are placed by 
a community of scholars." At the same time, Collex transpar- 
ently reveals the constructed nature of its network through its 
ever-shifting tags. lags are user-generated words that describe 
objects found in NINES, modeled on the crowd sourcing uses of 
word clouds in sites from flickr to Delicious. As users attach tags 
to individual digital objects, such as scholarly articles, poems, or 
paintings, the word cloud and the search are altered, reminding 
the user that the objects exist within and are impacted by a schol- 
arly community using the site (see fig. 2.2). 

The manipulation of the digital archive materials through the 
Collex tool is akin to the use of a box of papers found in a special 
collections room. Grouped by subject, patrons pull various pa- 
pers and objects from a box, ordering the items by interest and 
possible connectivity. As patrons finish with the materials they 
return them to the box in new constellations that represent in- 
terpretations. Unlike the earlier digital edition, where scholar- 
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Fig. 2.2. NINES Collex tags in a word cloud. 


creators worked to fix the original text to one meaning, digital 
archives reveal the social construction of meaning through inter- 
face, design, and use. 

'The lure of the rare book room echoes in the construction 
of digital archives from this period. At the heart of digital ar- 
chive construction is the desire for immersion in a wide range 
of textual materials, what Kenneth Price calls “the ideal of . . . 
all-inclusive resources for the study of given topics.”*! New his- 
toricist scholars who perceived the archive as the space of schol- 
arship viewed the digital as a tool for enacting the ideal of textual 
inclusion. For example, the WWA is premised on the problem of 
Whitman's manuscripts, "scattered in over sixty different insti- 
tutional repositories, and poetry manuscripts have been located 
in twenty-nine repositories."? The construction of an inclusive 
digital archive would return the manuscripts to a central loca- 
tion, the WWA, and, even better, allow for additional materi- 
als to interact with Whitman’s writing. Such concerns have led 
to the hallmark of digital archives, the ever-expanding archive. 
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Tbe Willa Catber Archive (WCA), for example, has increasingly 
expanded the interrelated materials housed on its site, evolving 
from publication of a scholarly edition to the inclusion of let- 
ters, images, audio and movie clips, interviews, speeches, public 
letters, and a geographic representation of Cather’s life. The or- 
ganic archive, however, could quickly become the unbounded, 
never-ending digital project. Like new historicist work, often de- 
scribed as scholarship without easily defined boundaries, digital 
archives tend to be porous rather than restrictive. The possibil- 
ity of ever-expanding materials, however, is a greater problem in 
the digital environment, where press conventions are not active, 
digital storage is capacious, and additions and corrections might 
continue indefinitely. Ed Folsom connects the expansiveness of 
the WWA to the huge, unfinished Whitman print project, the 
Collected Writing of Walt Whitman, noting that the editors of the 
Collected Writing "thought, just as Kenneth Price and I foolishly 
thought when we began the Walt Whitman Hypertext Archive, that 
they'd be done with the project in a few years."? What Folsom 
misunderstands, however, is that an edition of writings, whether 
print or digital, is far more contained than the idealized digital 
archive driven by the underlying principles of organicism and the 
unbounded understanding of the text. Developers of digital ar- 
chives have attempted methods of border control, including the 
most common method, a focus on a particular author, period, or 
geographic location, but most archives continue to struggle with 
material selection. 

Digital archivists were also impacted by the newly expanded 
canon, in part a result of new historicist work invested in revaluing 
texts for their historical and cultural impact rather than just liter- 
ary style. Such scholarship brought works by Charlotte Perkins 
Gilman, Kate Chopin, and Harriet Beecher Stowe, among many 
others, into a far broader canon than existed during the new criti- 
cal period, leaving digital archivists further concerned with how 
to delineate boundaries for archive construction. In fact, the im- 
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pact of canon expansion might be traced from print to digital in a 
number of early archives, including Stephen Railton’s Uncle Toms 
Cabin & American Culture. Jane 'Iompkins's 1985 book Sensational 
Designs successfully argued for the central position of Uncle Tom's 
Cabin in the nineteenth-century American canon. Re-situating 
sentimental fiction as an important form of literary production 
and one worthy of study, Tompkins drew upon contemporary 
feminist and new historicist work, arguing that “novels and sto- 
ries should be studied not because they manage to escape the 
limitations of their particular time and place, but because they 
offer powerful examples of the way a culture thinks about itself, 
articulating and proposing solutions for the problems that shape 
a particular historical moment.”* The importance of Uncle Tom's 
Cabin and similar sentimental fictional texts is not found in their 
literary value, claimed Tompkins, but in what the texts reveal 
about the culture from which they came. The job of the liter- 
ary critic, then, is to examine the multiple threads of history and 
culture in which the text is produced, providing deep readings 
that reveal new insight into the text. Rejecting the formalist read- 
ing of literature, “stylistic intricacy, psychological subtlety, epis- 
temological complexity,” Tompkins sets Uncle Tom's Cabin within 
a web, and here I use the term in all of its multiple meanings, of 
cultural and historical materials.? Stephen Railton’s Uncle Tom’s 
Cabin & American Culture reiterates the new historicist argument 
in digital form by situating Stowe’s text within a network of cul- 
tural and historical texts, digitally enacting Tompkins’s argument. 
Produced at the University of Virginia in partnership with the 
Electronic Text Center, Special Collections Alderman Library, 
IATH, and the Harriet Beecher Stowe Center, the site compiles 
cultural materials that illustrate the centrality of Stowe’s text to 
nineteenth-century culture. Called a “Multi-Media Archive,” the 
site includes various versions of Uncle Tom novels, images, songs, 
tracts, critical essays, reviews, adapted plays, songs, and movies. 
Organized around responses to and adaptations of the novel, 
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the archive represents a place in which a scholar might conduct 
broad and deep historical and cultural research, the hallmark of 
the digital archive moment. While Railton does not identify as a 
new historicist, there is every indication that new historicism in- 
forms the materials selection and that the archive argues for the 
importance of Uncle Tom's Cabin, the novel, using the same crite- 
ria as Tompkins’s scholarly work. Where Tompkins’s scholarship 
points to the centrality of the sentimental novel, Railton's archive 
includes a number of pretexts from sentimental culture. Where 
Tompkins’s scholarship points to the centrality of religious piety, 
Railton’s archive includes Christian texts. Railton even extends 
Tompkins’s argument for the cultural standing of Uncle Toms 
Cabin by including multiple responses to the text, such as reviews 
and adaptations, and by creating an archive where the selected 
materials show the significance of the novel within Stowe’s con- 
temporary culture. 

No matter that the idealized digital archive was expansive, the 
signature style of new historicism, the anecdote, reinforced the 
continued centrality of the individual item. Peppered through- 
out new historicist scholarship, the anecdote “is drawn from di- 
verse archival disciplines . . . [and is] placed alongside authorial 
‘culturally sanctioned’ literary texts.”*° Anecdotes range from a 
painting read against a Shakespeare play to a bawdy broadside 
illuminating a religious text, with the anecdote reinforcing the 
new historicist belief in *the value of the single voice, the isolated 
scandal, the idiosyncratic vision, the transient sketch." How- 
ever, the new historicist commitment to viewing the object or 
the anecdote within a larger connective social web created an 
unresolved tension between the individual and the collective that 
would be replicated in digital archives. Scholars developing digi- 
tal archives idealized organic and expansive datasets but faced 
limitations including economics, scale, and form that have lim- 
ited the produced archives to the point where the final product 
might be read as a collection of anecdotes. Instead of seeing the 
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contradiction as insurmountable, scholars such as Alan Liu view 
the anecdote as a form of “random access. If the new historicism 
is a kind of relational database, then the anecdote is its query." 
Anecdotes might be seen as the way into history, as “the random 
anecdote’s interior contradiction, irony, or aporia . . . exposes the 
fault lines in the ‘reality’ of history itself.” In its digital version, 
“New Historicism is the intuition simultaneously of random ac- 
cess (an atheist transcendent) and determination (a bowing down 
or conviction)."? Though the anecdote might appear limiting, 
even retrograde, Liu’s analysis situates it as a wedge through 
which to view the moments that do not fit, the space in which 
scholars might successfully probe for new findings. 

Perhaps the most enduring legacy of new historicism is digital 
literary studies’ rejection of the innate meaning of both individual 
objects and the structures in which they function. New histori- 
cists scrutinized form and materials, studied what was in and out 
of the archive, and questioned the power relations formed by the 
construction of and use of the archive. If new historicism is dis- 
tinguished “by its lack of faith in ‘objectivity’ and ‘permanence’ 
and its stress not upon the direct recreation of the past, but rather 
the processes by which the past is constructed or invented, "?? so 
too are digital archives characterized by the same. Scholars who 
engage in digital archive creation also refused to view the archive 
and the technology used to create it as naturalized or organic 
and remained self-reflexive about all aspects of the archive, from 
materials selection, to metadata, and to interface design. Writ- 
ing about The Women Writers Project, Margaret Ezell underscores 
that “how archives are put together, maintained, and accessed 
affects the stories that are told about them and the stories that 
their contents tell."*! Technologies used to create archives, such 
as TEI/XML, the de facto metadata standard in digital literary 
studies, have been carefully dissected.’ New historicist insistence 
on social construction bleeds into the way scholars understand 
metadata and the encoding of texts. Martha Nell Smith is clear 


56 * TRACES OF THE OLD, USES OF THE NEW 


that the TEI is not a naturalized computer code but a socially 
produced form impacted by cultural and social values; ^Meet- 
ing the challenge [of encoding texts] requires asking in as many 
ways as can be imagined how to go about reckoning the hier- 
archies apparently accepted by coding for rendering the images 
and texts in digital format, hierarchies that stubbornly resist par- 
ity between the intellectual/textual object and physical object but 
that insist one must be subordinate to the other"? We should not 
underestimate the importance of such an understanding of self- 
reflexivity, as it provides a methodological means for digital hu- 
manities to focus on a broad set of theoretical engagements from 
critical code studies to datamining. The notion of a scholar's pur- 
view as broad and diverse, interconnected and social rather than 
limited to a particular author or literary text, crucially influences 
the direction of digital literary studies. As digital archivists began 
to interrogate code and connectivity they were implicitly arguing 
for an expansion of study akin to the interdisciplinary interests of 
new historicism, or the big tent of digital humanities. 

As this analysis demonstrates, the impact of new historicism 
on the theoretical and structural conceptualization of the digital 
archive is foundational to the way that literary digital humani- 
ties has evolved. However, one crucial, muted issue that has deep 
roots in the new historicist past continues to plague the evolving 
field—the growing hostility between literature and history schol- 
ars working within the digital environment. David Parry's post 
on his AcademHack blog provides one example of this tension: 


Digital Historians have leveraged the digital to expand 
and engage a wider public in the work of history. As ex- 
amples of this think of Omeka, or leveraging social me- 
dia to engage in crowd sourced projects. That is, Digi- 
tal Historians have often begun by asking *how does the 
digital allow us to reach a larger/public audience?" Now 
this could be because many of the folks working in Digital 
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History come from a public history background ... But in 
the case of literary studies the “digital” projects have not, 
as much, changed the scope of the audience. So that if you 
look at digital literary projects they often look remarkably 
similar to projects in the pre-digital era, just ones which 
have been put on steroids and run thru a computational 
process. Seems to me that the Digital Historian model is 
a better one.** 


We might dismiss Parry’s post as symptomatic of academic Bal- 
kanism, yet the expressed division between literature and history 
scholars is not an isolated incident. Historian Edward Ayers has 
stated, “The irony is that history may be better suited to digital 
technology than any other humanistic discipline." ‘To historians 
like Dan Cohen, digital work is most appropriate for history be- 
cause of scholarly approach. Cohen posits, 


We need to recognize that the digital humanities repre- 
sent a scary, rule-breaking, swashbuckling movement for 
many historians and other scholars. We must remember 
that these scholars have had—for generations and still in 
today’s graduate schools—a very clear path for how they 
do their work, publish, and get rewarded. Visit archive; do 
careful reading; find examples in documents; conceptualize 
and analyze; write monograph; get tenure. We threaten all 
of this. For every time we focus on text mining and pattern 
recognition, traditionalists can point to the successes of 
close reading—on the power of a single word. We propose 
new methods of research when the old ones don’t seem 
broken.” 


On the one hand, Cohen’s representation of the threat of digital 
humanities rings true. New approaches to research create tension 
within scholarly fields, particularly as scholars must learn new 
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techniques. Situating the archive as a central trope in the battle 
over digital humanities, Cohen suggests that the close reading 
approach, “the power of a single word,” is a traditionalist meth- 
odology not appropriate for the new digital form.” However, 
as the previous discussion of both the archive and the edition 
reveals, close attention to individual objects, such as the word, 
remains a guiding principle in digital literary studies. While the 
data driven model has much to offer scholarship, those that come 
to the digital from a new historicist background are suspicious 
of a monolithic use of data, not because it is data but because the 
individual moment, which can become occluded in such a data 
dump, has the power to rewrite and disrupt the larger narrative 
only when it is located by a scholar through an interpretive mo- 
ment. To digital literary scholars, the uniqueness or individuality 
of an object needs to be understood as potentially disrupting the 
larger context, rather than a piece that is always viewed within 
a continuous narrative. Here Alan Liu’s understanding of the 
anecdote as random access in a database is illuminating, as the 
anecdote allows disruption through “the single word,” the very 
object that Cohen dismisses. The dangerous trend toward the 
homogenization of digital work in the supposed interdisciplin- 
ary digital humanities masks the methodological concerns that 
promote such differences and needs to be resisted as the broader 
digital humanities develops. 

If the digital humanities are interdisciplinary, why the con- 
tinued conflicts between history and literature? The history of 
interdisciplinarity in the academy provides one means of read- 
ing the dislocation. Joe Moran’s Interdisciplinarity traces the rise 
of disciplines from Aristotle through current academic structures 
and contends that, “Broadly speaking, the development of disci- 
plines has not merely created self-contained bodies of knowledge, 
happy in their isolation; it has been accompanied by frequent at- 
tempts to assert the superiority of certain fields of learning over 
others."** Moran’s “self-contained bodies" that house interdisci- 
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plinary thought provide a means of understanding the fraught 
relationship of history and literature. If digital interdisciplinarity 
is based on traditional assumptions of multi-disciplinary knowl- 
edge housed within one body, then hierarchical tensions surely 
will occur. Far more productive is the emphasis on interdisciplin- 
arity built through collaboration. It is also likely that the struggle 
between history and literature might be more fraught than other 
disciplines because of the two disciplines’ historical emergence: 


The two subjects were sometimes taught together in early 
degrees at dissenting colleges in the nineteenth century, 
and they developed as fully-fledged academic subjects at 
around the same time. Each of these disciplines contains 
elements of the other: literary studies often draws on his- 
torical material, while everything, including literature, 
could be said to have a history. The obvious connections 
between the subjects, however, have not always encouraged 
co-operation; they have often led to greater territoriality, as 
each subject has sought to consolidate its own separateness 
and uniqueness.?? 


The tensions between history and literature are nothing new, ac- 
cording to Moran, yet certain periods heighten the anxiety. New 
historicism was one such moment where clashes between litera- 
ture and history peaked, and the new age of digital archive work 
seems to have reignited the battle. 

Yet we need to recognize that the reason disciplinary tension 
appears in contemporary discussions is because disciplines still 
matter. Disciplines govern our academic lives, from our gradu- 
ate training, to our position in the university, to the type of work 
valued, to our ability to advance in our careers. Universities con- 
tinue to organize knowledge groups into traditional subject areas, 
so it should come as no surprise that we find it difficult to work 
outside our traditional structures. If we agree that one scholar 
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cannot move intellectually across all disciplinary fields, then we 
must learn how to use the generalized umbrella term of digital 
humanities to bring together scholars from multiple fields. One 
of the strengths of the broad digital humanities is its ability to 
pull together diversely trained scholars, as “there may be human 
intellectual limits to interdisciplinarity: given that most research 
in the humanities is undertaken by scholars working on their 
own, it may be difficult for these people to become conversant in 
the theories, methods and materials of two or more disciplines, 
without producing significant gaps in their knowledge.” We 
pretend that disciplinary boundaries have ceased to exist at our 
peril. Instead of shying away from such complexities, we should 
embrace the heady dissention. While the old/new historicist split 
created controversy rather than interdisciplinary cooperation, 
it also generated intellectual stimulation and fertile ground for 
experimentation. This tension between fields could prove fruit- 
ful for digital scholarship, generating new intellectual questions. 
Moran cautions that *[m]any of the examples of interdisciplinar- 
ity I have examined so far represent a kind of nostalgia for the 
lost unity of knowledge, and they see the discipline of English 
as the best way of restoring this."^' By shifting interdisciplinarity 
out of one body to many bodies, by rejecting a coherence of ap- 
proach, we may move away from the seduction of unity. To model 
interdisciplinarity, we must reject calls to lock down digital hu- 
manities to a particular methodology, instead privileging a broad 
range of approaches. 

The archival turn in American digital literary work has cre- 
ated a theoretical foundation of use and analysis that under- 
pins our current work. The construction of digital archives has 
brought cohesion to the field and allowed for the development of 
standardized approaches to the work that literary digital schol- 
ars are interested in undertaking. Yet, we need to remember that 
new historicism has taught us that the archive is our construc- 
tion. Without careful attention to the way in which the archive 
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structure itself represents our theoretical approach we have done 
a disservice to the possibility of digital work as more than just 
technique or application. As Louis Montrose wrote during the 
height of new historicism: “Inhabiting the discursive spaces tra- 
versed by the term ‘New Historicism’ are some of the most com- 
plex, persistent, and unsettling of the problems that professors 
of literature attempt variously to confront or to evade." The 
scholarly complexity articulated by Montrose remains persistent 
and valid in contemporary digital studies. As Caroline Steedman 
reminds us of physical archives, “The Archive is not potentially 
made up of everything, as is human memory; and it is not the fath- 
omless and timeless place in which nothing goes away that is the 
unconscious. The Archive is made from selected and consciously 
chosen documentation from the past and also from the mad frag- 
mentations that no one intended to preserve and that just ended 
up there."? As our new historicist forerunners have taught, the 
digital archive has the potential to disrupt and fragment, even as 
it provides coherence and the most productive scholarship works 
within these gaps. 

The central position maintained by digital archives in the late 
1996s and early 2000s is diminishing. Fewer digital projects are 
adopting the term archive, and scholars are turning their atten- 
tiveness to datamining and geospatial representations. Yet the 
archive remains an important form that those working within 
the field must engage, as it strengthened the reputation of digital 
scholarship and laid the theoretical foundation for current work. 
We will continue to see projects that are best developed in the 
digital archive form, much as we continue to see digital editions. 
Yet there are new trends in digital literary studies that must also 
be tested and examined as we move forward. 


CHAPTER 3 


What’s In and What’s Out? 


Digital Canon Cautions 


Cyberspace is an environment comprised entirely of o’s and 
1’s: simple binary switches that are either off or on. No in- 
between. No halfway. No shades of gray. All too often, when 
it comes to virtual culture, the subject of race seems to be one 
of those binary switches: either it's completely “off” (i.e., race 
is an invisible concept because it's simultaneously unmarked 
and undiscussed), or it's completely “on” (i.e., it's a contro- 
versial flashpoint for angry debate and overheated rhetoric). 
While there are similar patterns of silence about race when 
it comes to interpersonal interaction in “the real world,” the 
presence of visual and aural markers of race (no matter how 
inaccurate those may be) means that race is rarely (if ever) as 
invisible offline as it is in cyberspace. 


—Beth E. Kolko, Lisa Nakamura, and Gilbert B. Rodman, 
Introduction to Race in Cyberspace, Routledge, 2000, 1 


In 2010, I located Sharon Harris’s Early American Women Writers 


website. By 2011, the site had been removed. 


As the previous chapters chronicle, scholars working in the early 
period of digital literary production experimented with delivery, 
forms of access, interfaces, and representations of materiality. 
The digital edition and digital archive models were successfully 
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replicated leading to a period of rapid digital production. Work- 
ing within these models, a subset of scholars focused on what I 
dub *digital recovery projects," archives and editions that used 
digitization to expand what such scholars saw as an outmoded 
new critical literary canon that excluded work by women, people 
of color, queers, and others. Digital recovery projects emerged 
out of activist cultural studies communities and were tied to 
evolving understandings of the emergent Internet. 

In the 1970s and 1980s, canon expansion was the literary holy 
grail. Groundbreaking scholarship, such as The Madwoman in the 
Attic by Gilbert and Gubar, argued for a broader canon, herald- 
ing a new generation of scholars that would begin to edit and 
publish previously excluded writers.’ But scholarly presses had 
limited budgets and numerous texts remained unpublished or 
out of print, rejected because they were deemed noncommer- 
cially viable. Further, scholars of African American, Asian Ameri- 
can, Native American, and Latino/a literatures charged that the 
white feminist movement had not sufficiently attended to writers 
of color. Among the most influential texts that attracted criticism 
was the Gilbert and Gubar Norton Anthology of Literature by Women 
(1985), condemned for its lack of women writers of color, diver- 
gent classes, and pre-1800 women’s writing.’ Scholars charged 
that the Norton Anthology had created an alternative canon that 
was as problematic as the previous canon that had excluded wom- 
en’s writing. Margaret Ezell’s influential Writing Women’s Liter- 
ary History (1993), for example, articulates how “structures used 
to shape our narrative of women’s literary history may have un- 
consciously continued the existence of the restrictive ideologies 
that initially erased the vast majority of women’s writings from 
literary history and teaching texts." One remedy to what was 
seen as a continued exclusionary canon was the launch of activist 
presses such as the Feminist Press (1970) and the Kitchen ‘Table: 
Women of Color Press (1980), dedicated to publishing materials 
that mainstream and even scholarly presses were not printing. 
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Despite valiant efforts by activist presses and the increased publi- 
cation of “lost” texts by mainstream and scholarly presses, it was 
clear that such efforts alone would not solve the canonical eli- 
sions. Dissatisfied, scholars and activists began to look for other 
solutions to solve the canon problem. 

In the 1990s, proponents of the Internet began to tout the 
developing technological infrastructure as a tool to democratize 
knowledge. Popular culture portrayed the early 1990s Internet as 
an idealized, democratic, and free space that needed protection 
from the corporate market forces intent upon invading the open 
space theretofore dominated by scientists, hackers, and geeks. 
Advocates of the free web championed three ideas: ^1) Access 
to computers should be unlimited and total; 2) All information 
should be free; (and) 3) Mistrust authority and promote decen- 
tralization,” all ideas designed to allow “bubbles” of information 
to rise from the bottom, sowing “seeds of revolutionary change." 
Scholars, too, began to understand the net as a space that altered 
power structures. As Paul Delany reports, 


The Internet has thus mutated into an unforeseen and un- 
planned information space. Its virtues can all be attributed 
to its collegial political economy: in a word, its openness. 
Internets most important features are its relatively small 
hardware investment, a weak (but not ineffective) central 
administration, little censorship, and an absence of specifi- 
able “bottom-line” objectives. Its explosive growth in the 
last few years confirms the dynamism of a collegial cyber- 
space culture in which millions of users exchange informa- 
tion, collaborate on creative projects, and have their say on 
any subject they care about. 


For scholars interested in challenging the traditional canon, the 
technological possibilities were a boon. It was imagined that the 
web would allow those previously cut off from intellectual capi- 
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tal to gain knowledge that might be leveraged to change their 
social position and would allow those who had been silenced to 
have a voice. Hypertext theorist Jay David Bolter, for example, 
promoted the freeing power of the web-based environment as a 
space that encouraged “the abandonment of the ideal of high cul- 
ture (literature, music, the fine arts) as a unifying force. If there 
is no single culture, but only a network of interest groups, then 
there is no single favored literature or music."^ T he belief in the 
low cost of Internet-based publication would encourage scholars 
to embrace the Internet as a vehicle for disseminating cultural 
materials. In 1996, Peter Shillingsburg anticipated that it “even- 
tually will cost less to produce and therefore, one assumes, to 
purchase a compact disk than it cost to produce and purchase 
Hans Gabler’s edition of Ulysses." In hindsight, Shillingsburg 
and the larger scholarly community have rejected this view as 
naive and unrealistic, yet it was a common refrain in the early, 
heady days of digital recovery, where the faith in the Internet's 
ability to shift hierarchical structures became part of the narra- 
tive that in no small part drove the proliferation of digital literary 
recovery projects. 

From the mid-199os through the mid-2000s there was an 
explosion of digital sites that championed work previously ex- 
cluded from the canon, what I label digital literary recovery 
projects. Scholars worked individually or as small collectives 
to produce bibliographies or to publish primary texts. Simple 
HTML projects, such as The 19tb Century American Women Writ- 
ers Web (1g CWWW), Voices from the Gaps, Early American Women 
Writers, The Black Poetry Page, The Online Archive of Nineteentb- 
Century U.S. Women's Writings, and American Women Writers 1890 
to 1939—Modernism and Mythology,’ were developed by scholars 
without the support of a digital humanities center, technological 
collaborators, or external funding. The activist recovery projects 
of this period were positioned to critique and respond to per- 
ceived weaknesses in the existing canon. Projects during this pe- 
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riod were created using HTML, HyperText Markup Language, 
in large part because HTML was a fairly simple to learn tag- 
ging system that became even easier to use with the development 
of early HTML editors, including Claris Home Page, Mozilla 
Composer, and Adobe PageMill, all of which allowed novices to 
easily publish their findings. Sites built during this period were 
largely labors of love launched by scholars interested in dissemi- 
nating materials that they found buried in difficult to access rare 
book rooms, crumbling newspapers, and unknown journals. 
Such work envisages digital literary scholarship as a tool that 
might be utilized to meet the theoretical demands of scholarly 
work that reinserted women, queers, and people of color into 
the canon. For example, groundbreaking feminist scholar Judith 
Fetterley, whose 1978 The Resisting Reader: A Feminist Approach 
to American Fiction did much to challenge the traditional literary 
canon, nurtured a small online digital project to bring scholarly 
attention to the huge numbers of nineteenth-century women’s 
texts that deserve critical scrutiny. Fetterley’s personal website at 
the University of Albany, from which she is now retired, includes 
several pages of bibliographies and chronologies of what were 
then little known literary texts by American women." Titled the 
19tb-Century Bibliography Project, the site includes bibliography 
entries focused on women writers from the 1820s and 18305 ar- 
ranged chronologically and alphabetically (see fig. 3.1). 
Developed by students in a graduate course that Fetterley 
taught in 1995 and then compiled and published on the web 
by her student, Annie (formerly Lois Dellert) Raskin, Fetterley 
viewed the project as both pedagogical and activist. Articulating 
similar concerns about the canon as Ezell, Fetterley explains that: 


Underlying this project was the desire to make clear to stu- 
dents that the texts available for such a course as this rep- 
resent only a fragment of the total work of women writers 
during the period, and that we are as much in danger of 
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Chronological listing of bibliography entries: 
1830-1835 


Bullard, Anne Tuttle Jones. The Stanwood Family; or, The History of the American Tract Society. 
Farrar, Eliza Ware Rotch. The Children's Robinson Crusoe. 

Gould, Hannah Flagg. Mary Dow, and the Lisle Beggar Girl. 

Hale, Sarah Josepha Buell. Conversations on the Burman Mission. 

— Poems for our children: designed for families, Sabbath schools, and infant schools. Part 1. 

Hinckley, Mary. The Seymour Family; or, Domestic Scenes. 

— — Sequel to the Seymour Family; or, Domestic Scenes. 

Larned, Mrs L. The Sanfords; or, Home Scenes. 

— — Grace Seymour. 

Robbins, Eliza. Elements of mythology. 

Royall, Anne Newport. Letters from Alabama, 1817-1822. 

— —— Mrs Royal's Southern Tour; or, Second Series of The Black Book. 3 vols. 

Sedgwick, Catharine Maria. Clarence; or, A tale of our own times. 

Sedgwick, Elizabeth Buckminster Dwight. A Spanish Conquest of America. Designed for the use of children. Bostoa: Leonard C.Bowles, 1830. 
Sedgwick, Susan Ann Livingston Ridley. The children's week. 

— The young emigrants: A tale designed for young persons. 


Talhot. Mary Flrsheth: Ruralire: Orivine! D'exulwv Toles 


Fig. 3.1. Chronological listing of bibliography entries: 1830-1835. 19th- 
Century Bibliography Project. Judith Fetterley and Annie Raskin. 


developing a skewed version of American literary history 
if we look only at the texts by women that have been re- 
printed in the last twenty years as we are if we look only 
at the work of male writers. In other words, the category 
of the “marginalized” can seem quite privileged when set 
in the context of a category that might well be called *the 
not-yet-marginalized."!! 


The online bibliography was a means of enacting a broader 
canon than that represented in print and is also indicative of the 
many such early digital projects where self-publication was de- 
signed to distribute work in progress, expand the literary and his- 
torical record, and spur new interest in the recovered texts. Un- 
like a print bibliography, which purports to be an authoritative, 
complete, and static overview of a particular author or topic, the 
19tb-Century Bibliography Project is understood to be incomplete, 
in effect "an introduction to the various bibliographic resources 
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available for research in this field.""? The project also views the 
digital as a means to initiate collective action and, accordingly, 
invites revisions: “We offer these bibliographies in the hope that 
they will be useful to others and we welcome comments, cor- 
rections and discussion."? The collaborative approach to de- 
veloping and sharing the materials, with Fetterley utilizing the 
classroom as a textual cottage industry, is typical of this period, 
and other projects such as The Charles Chesnutt Archive also fol- 
lows the pedagogical model to bring "lost" texts online. Unfortu- 
nately, though, the 19th-Century Bibliography Project is one of the 
many digital projects built during this period that are in danger 
of disappearance. The project is fragile not because of its digital 
form but because there is no preservation strategy nor commu- 
nity surrounding the project. While participation and revision is 
invited, there is no indication that the materials found in the bib- 
liography had more than perhaps a revision or two. Published in 
1996, the pages were last updated in 1997 and are now static and 
even decaying, an emblem of the larger state of digital recovery 
projects from this period. 

In hindsight the limitations of recovery projects are clear, but 
for scholars interested in reworking the canon in the late 1990s, 
the web seemed a space of possibility. Scholars believed that the 
digital environment prevented presses and editors from limit- 
ing the types of work published. ^What is new in the twenty- 
first century," writes Susan Fraiman, "is that now the guest list 
of history-making women is electronic—and there are always 
more seats at the table.”'* The belief in the Internet as a means 
to break the canon was likewise espoused by the editors of Ro- 
mantic Circles: “One of the strengths of Web publishing is that it 
facilitates—even favors—the production of editions of texts and 
resources of so-called non-canonical authors and works." Like 
Fraiman, the editors of Romantic Circles laud web publishing's 
ability to allow the scholar to work outside traditional publication 
mechanisms and to create expansive canons. The editors go on to 
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identify the simplicity of digital publication as a key component 
of the success of the web. Success "is in part a function of the 
relative simplicity of HTML (and all of the simpler document- 
type-descriptions of SGML) and of ‘workstation publishing’ in 
general when compared to traditional commercial or academic 
letterpress production and distribution methods."!^ The ease of 
publication identified by the editors was what allowed the broad 
range of small-scale recovery projects to explode in this early pe- 
riod, as most of the projects were produced using simple hand 
coding of HTML or HTML editing programs. Digital recovery 
project URLs reveal that creators of projects often "published" 
their materials on personal webspace allotted by colleges and 
universities to their faculty and students. Technologically inex- 
perienced users could create a simple textual project using low 
cost and low expertise methods, an approach contradictory to 
much of contemporary digital project development that tends to 
be conducted with the more complicated TEI/XML, databases, 
tools, and even datamining and algorithmic approaches. The low 
threshold at this moment of digital literary studies resulted in a 
broader set of textual materials being created and a democratic 
digital development. 

‘Tracking the rise of the queercore, riot grrrl, and straight edge 
punk movements in the late 1980s and early 1990s provides one 
lens through which to read the digital literary studies movement. 
Emerging from 1970s punk, queercore, riot grrl, and straight 
edge punk movements adopted self-publication as activist in- 
terventions. Amy Spencer describes the riot grrl movement as 
"about using anything you can get your hands on to shape your 
own cultural entity; your own version of whatever is missing in 
mainstream culture," an ideology replicated in digital recovery 
projects from the mid to late 1990s. As did those interested in 
the digital recovery of texts, activist oriented music refused to 
rely on traditional methods of artistic dispersal, instead creating 
a self-publishing movement that allowed participants to create 
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a broader canon of artistic expression. Riot grrl self-publication 
gave voice to the excluded: 


BECAUSE us girls crave records and books and fanzines 
that speak to US that WE feel included in and can under- 
stand in our own ways. BECAUSE we wanna make it easier 
for girls to see/hear each other's work so that we can share 
strategies and criticize-applaud each other. BECAUSE we 
must take over the means of production in order to create 
our own moanings. BECAUSE viewing our work as being 
connected to our girlfriends-politics-real lives is essential if 
we are gonna figure out how we are doing impacts, reflects, 
perpetuates, or DISRUPTS the status quo.'* 


Embracing a personal is political approach, both groups made vis- 
ible excluded women, people of color, and queers—whether the 
exclusion was at a punk show, in print culture, in the classroom, 
or on a music label. The notion of such work as “underground” 
was vitally important, as the groups positioned their work against 
that of the dominant society. If mainstream record companies 
wouldn’t sign queer singers, then singers would produce and dis- 
tribute their work directly to fans. If scholarly publishers refused 
to print a relocated women’s text, then scholars would create and 
publish the materials on the web for use by those interested. 
Often localized and idiosyncratic, both movements were able 
to use new models of technology to promote inclusion and share 
understandings of voice and power through do-it-yourself (DIY) 
production of materials. Moore and Mitchel argue, “We see the 
DIY mechanism as both a relational mechanism because it both 
‘alters connections between people, groups, and networks,’ which 
creates solidarities, and as a ‘cognitive mechanism,’ because it 
‘operates through alterations of individual and collective per- 
ception’ by providing a foundation for the creation of imagined 
communities via taste and aesthetic choice.”!’ A hallmark of the 
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DIY literary recovery projects is the development of a virtual 
community through various linking strategies. Projects often 
referenced other like-minded recovery projects through a list of 
links, creating an ever-expanding virtual network. For example, 
the Native Web included a list of authors, writers, and biographies 
(see fig. 3.2). 

Native Web is activist in construction, community-focused in 
orientation, with a primary goal *to foster communication among 
peoples engaged in the present,"? a common thread in such digi- 
tal recovery work. The importance of what I call “curated hyper- 
linked" sites has been unremarked upon by digital humanities 
scholars, a remnant of late 1990s web culture that now seem sim- 
plistic and out of date. Yet such work was pivotal to the formation 
of digital literary culture. Influential curated hyperlinked sites of 
the period include Alan Liu's The Voice of the Shuttle and Randy 
Bass’s The American Studies Crossroads Project. ^ Donna Camp- 
bell’s American Writers project provides an excellent example of 
curated hyperlinked projects used to construct a community of 
scholars interested in revising canons” (see figs. 3.2 and 3.3). 

Launched in 1997, Campbell created the website to support 
a literature class. Housed on her personal webspace at Gonzaga 
University, the site grouped materials by author, time period, and 
literary movement. Historical, cultural, and biographical materi- 
als are also included as are links to primary texts.? Campbell’s cu- 
rated hyperlink site and others like it were developed to renego- 
tiate canonical boundaries. Hence, the American Writers author 
list is far more diverse than contemporary anthologies, including 
little studied women, Native American, Latino/a, African Ameri- 
can, and Asian American writers. Rather than devaluing such sites 
as technologically simplistic, we must resituate curated hypertext 
sites as crucial genres in the evolution of digital literary studies. 

Open access, the use of digital technologies to distribute 
scholarly materials without cost or restriction, is also forecast in 
the 1990s digital recovery movement. For example, Kim Wells’s 


Nation Location 


A Man of Spirit Blackfoot US-Southeast 26 
Native American Author David Biser has a new release entitled "A Man of Spirit” A contemporary work based on a 
true event where the past and present co-exist. 


He has now completed his masters and is teaching a creative writing course at UBC. Take a look at his work. 


Cal Thunder Hawk Lakota United States — 1152 


Amusing site that features "Native American Quiz for Skins"; "The Amazing Anti-Pig Farm Rantomatic Email 
Gonersin’ amal vus rant in urar renibus Sirun trike cruincil officer itinn the inem on the cite’ «ameles nd Cale 


Fig. 3.2. Native Web screenshot, Authors, Writers & Biographies. 


Herve | Literary Movements | Timeine | Aneren Authors | Amescan Liane Stes 


Gertrude Simmons Bonnin (Zitkala-Sa) (1876-1938) 


—— Ziiksla-Sa: Secondary Bibliography 


Zfkaia-Sa Biography and links at the Native American Authors Project 
Biography by Roseanne Hoefel and bibliography at the The Oniine Archive of Ninoteenth-Century U S. Women's Writings. erage covery of ma was ) 
Biographical sketch by Melessa Rae Henderson at the Voices from the Gaps: American Women of Color site. (New URL) 


Ken Harzog's general commactary on leaching ZBete-Be fom ihe Heath Anthology sil. 
General Bibliography of Native American 


B 


Works Available Online 
Page images from the Affantic Monthly s 
* Impressions of an Indian Childhood (January 1900) 
* School Days of an Indian Girl (February 1900) 
* An Indian Teacher among Indians (March 1900) 
An indian Teacher Among Indians Atlantic Monthty (1900), 


Volume 85. 
Impressions of an Indian Childhood Atlantic Monthly 85 (1900)- 37-47. 
Old Indian Legends (1901) 


Soft Hearted Sioux Monthly , New York (1901 ) 
The Trial Path Harper's Monthly, Volume 103, October 1901 
"A Warriors (1902) 


Daughter" 
"Why | Am a Pagan” by Zitkala Sa Atlantic Monthly 90 (1902): 801-803. 


Cmn 3) Go 


Fig. 3.3. Zitkala Sa page, American Writers curated hyperlinked site. 
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Domestic Goddess website, which focused on canonically excluded 
women writers,” positioned open access and education as central 
goals of digital recovery projects: 


I think it is our duty as teachers not to ignore the possibili- 
ties of making research easily available on the Internet. If 
educators do not provide the information, who will? Do 
students have to pay for it, as a lot of encyclopedia com- 
panies are requiring now? This makes information only 
commercial, and even if they buy a subscription, sometimes 
the information they get is incomplete, and encyclopedic, 
rather than critical. These sites are great—but J want to 
teach for free.”° 


The desire to educate and freely disperse information is appar- 
ent in other early sites, including Kristin Mapel-Bloomberg's 
American Women Writers 1890-1939, Modernism and Mythology, 
and Shari Benstock’s Women of the Left Bank, all of which publish 
supplemental scholarly materials to their accompanying print 
books.?6 Such work, tied to DIY activist movements that bled into 
scholarship, is among the earliest representation of open source 
scholarly publishing and, like curated hyperlinked texts, needs 
to be reconfigured within the history of digital literary stud- 
ies. Though often dismissed as simplistic or unimportant, open 
source distribution of materials brought attention to writers that 
had not been represented in the canon and helped to launch our 
current understanding of public scholarship. 

As individual scholars began to share recovered texts online, 
libraries began to explore how they might use the digital to bring 
their collections to a larger audience. Etext centers were started 
at Rutgers University and Princeton University (Center for Elec- 
tronic Texts in the Humanities [CETH]), the University Library 
of the University of North Carolina at Chapel Hill (Documenting 
the American South), the University of Virginia (The Electronic 
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Text Center [Etext]), and other universities. 7’ Some of the cen- 
ters emphasized out-of-print and recovery texts, as was the case 
with the University of North Carolina's Documenting tbe Ameri- 
can Soutb, but most were focused on the digitization of canoni- 
cal texts. The UVA Etext Center is arguably the most important 
center during this period due to the quality and quantity of the 
projects they produced,” as well as the students trained within 
the center who include notable contemporary digital scholars 
Stephen Ramsay, David Gants, Lisa Spiro, Tanya Clement, Mat- 
thew Kirschenbaum, and Amanda French. The Center digitized 
a broad assortment of texts, recovered and traditional; noncanon- 
ical texts by Native American, African Americans, and women 
writers coexisted with Founding Father documents by Thomas 
Jefferson and Alexander Hamilton. While the bulk of the texts 
that the Center digitized were selected by Etext staff from the 
UVA library stacks, a type of pre-Google digitization project, 
individual scholars would also suggest targeted texts of interest, 
including many of the noncanonical texts, for digitization. Vir- 
ginia’s Etext Center was unusual in its digitization of noncanoni- 
cal texts as most other centers, according to Mandell and Gamer, 
were traditional in their approach: “the web—at least in its first 
stages—did reproduce canonical biases long inscribed in Roman- 
tic poetry. This was particularly true of ‘early’ electronic text col- 
lections, like those compiled at Oxford, Toronto, Berkeley, and 
Carnegie Mellon, to name a few."?? Those centers that produced 
noncanonical texts shared a common focus on faculty and gradu- 
ate student input in text selection. Scholarly participation, there- 
fore, was a central factor in the selection of texts digitized during 
this period. 

One successful recovery collaboration between an etext center, 
the University of Virginia, and an individual scholar, Jean Lee Cole, 
was the Winnifred Eaton Digital Archive (WEDA)" (see fig. 3.4). 

Immediately after publishing her 2002 book on Eaton, The 
Literary Voices of Winnifred Eaton: Redefining Ethnicity and Au- 


What’s In and What’s Out? + 75 


The Winnifred Eaton Digital Archive 


Loyola College in Maryland 


Hosted & Maintained by the Elcctronic Text Center 
University of Virginia Library 


1 Home | Browse | Search | 


About the archive 


Winnifred Eaton (1875-1954), writing under the pseudonym of Onoto Watanna, was the first person of Asian 
descent to publish a novel in the U.S. (Miss Numé of Japan, 1898; available in a reprint edition edited by Eve 
Oishi [Johns Hopkins UP, 1999]). Perhaps more significantly, she was the first Asian American to reach a 
national mainstream reading audience: between 1898 and 1925, she published over a dozen novels, most of 
them with Harper and Brothers, and dozens of short stories and non-fiction pieces, which appeared in mass- 
market periodicals such as Ladies' Home Journal, Frank Leslie's Popular Monthly, Century Magazine, and 
Harper's Monthly. Her second novel, A Japanese Nightingale (1901; available in a reprint edition edited by 
Jean Lee Cole and Maureen Honey [Rutgers UP, 2002]), sold hundreds of thousands of copies and was 
adapted for both Broadway and film. According to the testimony of surviving family members and Winnifred 
Eaton herself, the number of her periodical publications may have neared or surpassed a hundred works. 
However, scholars until now have only located twenty short stories and about a dozen non-fiction pieces; a 
number of these have been recently anthologized in The Half-Caste and Other Writings (2003), edited by 
Linda Trinh Moser and Elizabeth Rooney. 


None of the texts collected here appear in The Half-Caste and Other Writings or in any other anthology that 
includes Eaton's works. They thus greatly expand the known output of this important early figure in Asian 
American literature. The archive includes short stories, short fiction pieces, and complete novels, all of which 
appear in popular magazines of the period including American Home Journal (later renamed Conkey's Home 


Fig. 3.4. Winnifred Eaton Digital Archive, screenshot. 


thenticity (Rutgers University Press), Cole uncovered numerous 
“lost” Eaton stories and novels. Unable to secure a publisher for 
the little known author's work (publication of Eaton’s work was 
not commercially viable, according to the presses she contacted), 
Cole explored other options for publication. When she attended 
the Rare Book School at the University of Virginia, Cole was 
put in touch with the Etext Center to discuss the possible digi- 
tal publication of the materials! Cole created a digital archive 
housed on the UVA Etext Center servers that included an in- 
troduction to Eaton, twenty-three short stories, two novels, and 
thirteen nonfiction pieces. The WEDA archive is a particularly 
interesting example from this period as it is representative of the 
fluid boundaries between print and digital. The digital archive 
began as a way to add materials to the print monograph. Once 
the archive was complete, Cole recognized that scholarly credit 
would not be extended to her digital work, so she published a 
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critical article, checklist of materials, and a reprint of one short 
story in the journal Legacy. Cole's use of the digital as a means of 
increasing access to "lost" works is indicative of the period where 
the digital was seen as a tool for extending the canon. 

As we moved out of the 1990s, the flurry of digital recovery 
projects and the promise of activist work to restructure the canon 
began to fade. Digital recovery projects had made a huge impact 
on the texts available online, but such work seemed to contrast 
with the growing professionalization of digital literary studies. 
While digital editions and archives had coexisted with digital re- 
covery work in this earlier period, the increasing institutionaliza- 
tion of digital humanities shifted the focus away from such tech- 
nologically simplistic work and toward more robust and complex 
projects. The decline of digital recovery projects will produce 
what Kenneth Price calls a "newly emerging digital canon of 
American literature." Unfortunately, the canon we now have was 
created “partly by design and partly by chance,” a canon much 
more narrow than that of the contemporary print canon. 

The late 2000s marked a period of rapid decline in recovery 
work. By 2008, most of the etext centers including the Oxford 
Text Archive, the UVA Etext Center, CETH, and the Carnegie 
Mellon center had been shuttered or absorbed by their institu- 
tions' libraries, resulting in a rapid decrease of new digital re- 
covery projects.*? The proliferation of early activist projects by 
scholars interested in recovery dropped precipitately as literary 
scholarship shifted its focus away from canonicity (see fig. 3.5). 

JSTOR data provides a snapshot of the larger literary field's 
interest in particular subjects. Collecting data from journals, 
books, and primary sources, JSTOR data provides one context 
for understanding trends in scholarship. An analysis of JSTOR 
data from 1950 to 2014 reveals that scholarship focused on canon 
concerns peaked in 1999 and has been declining since that pe- 
riod.** It is not surprising that the same pattern of interest and 
decline is apparent in the digital record of textual recovery. 
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Fig. 3.5. JSTOR data, canon keyword search. Note that journal 
publications focused on canon peak in the late 1990s and drop through the 
current decade. 


In addition to a decrease in new projects, a large number of 
digital recovery projects produced during the heyday of such 
work have been or are in danger of being lost, of grave concern 
as it Is possible that we are creating a digital canon that is more 
representative of the literary canon of 1950 than that of the lit- 
erary canon of 2014. Analysis of Alan Liu's Voice of the Shuttle 
(VOS), a curated hyperlinked project that compiled a broad 
range of primary and secondary humanities materials from 1994 
to 2009, provides a means of measuring the loss of early digital 
recovery projects. Subjects range from the technology of writ- 
ing, politics and government, photography, dance, anthropology, 
and cultural studies among other categories. Links to primary 
texts are included in multiple categories, including Literature 
(in English), Literatures (Other than English), Minority Stud- 
ies, Cultural Studies, and Gender and Sexuality Studies. Because 
Liu designed the VOS as comprehensive and also relied on an 
early form of crowd sourcing, a network of scholars that con- 
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tributed links and updates, the site provides an excellent measure 
of the state of digital projects during the years the site was ac- 
tive. While the site has been static since 2009, the Wayback Ma- 
chine, part of The Internet Archive, has archived earlier versions 
of the project and related links that though incomplete provides 
one of the best tools for scholars exploring the state of humani- 
ties digital projects of the period.” Examination of digital texts 
curated by the VOS reveals that a tremendous number of early 
digital recovery projects have been lost. Not only are links dead 
but detailed searches of the web suggest that materials have been 
completely removed. For example, correspondence with Sharon 
Harris, creator of the Early American Women’s Website (EAWW), 
reveals that the site was removed in 2011. According to Harris, 
there are no available backup files.” The loss of the EAWW is a 
familiar occurrence. Examination of projects included in the VOS 
reveals a great deal of loss. For example, contemporary writer 
Maya Angelou is included in the Contemporary American Au- 
thors and African American Literature categories with a total of 
seven sites related to her work. Only two of the seven projects 
remain online. Of the five original projects, we are left with scant 
details that we might use to locate the original documents, such 
as original URL, author, site creator, and hosting university. This 
unfortunate loss is replicated time and time again, leaving the 
digital literary canon smaller with each loss. 

Tracking the production and subsequent loss of recovery 
projects reveals the haphazard history of the preservation of such 
projects. The Winnifred Eaton Digital Archive is one such example. 
Hosted by the University of Virginia's Etext Center as a separate 
and distinct digital archive from 2003 to 2010, the archive has 
been absorbed into the library’s digital collections by the depart- 
ment of Digital Curation Services at the University of Virginia, 
and, as part of the transition, most Etext Center materials have 
been cleaned and migrated to stable repositories.” All Eaton texts, 
then, are preserved and available through a standard search of 
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VIRGO, the University of Virginia Library database. The pres- 
ervation of individual recovery texts is rare and those involved in 
such work at the University of Virginia show great forethought. 
However, preservation decisions have created a new form, per- 
haps a new edition, of Eaton’s work rather than a copy of the 
original archive. UVA has broken the original digital archive into 
individual items, in effect mimicking the original print versions 
of Eaton’s work. In addition, the original archive includes images 
from magazines that accompanied the texts. These images are no 
longer available in the VIRGO versions.** Also removed from the 
current Eaton texts are files that include information regarding 
the production of the digital version of the text, such as original 
creator of the materials, conversion metadata, and editing his- 
tory.” The elimination of such material erases the bibliographi- 
cal markers of production that scholars are interested in viewing 
when they evaluate the reliability of a text. Such losses are exac- 
erbated by the dissolution of the archive itself. The stand-alone 
digital archive of Eaton’s works reveals the volume and breadth 
of her writing and gives weight to the importance of resituating 
Eaton within the American literary tradition. The transfer of the 
body of work into single entries obscures the largess of Eaton’s 
work, and Eaton’s legacy recedes into the database, leaving her 
recovered body of work obscured once more.*” 

The Winnifred Eaton Digital Archive is just one of many early 
digital recovery projects that are unstable. Some recovery projects 
contain texts that are unavailable as open source online materials 
but are found in subscription databases. Other projects have been 
fragmentally archived, as is the case with the Eaton materials. 
Still other projects have disappeared completely and are available 
only in the original print form from which they were recovered. 
All losses impact the shape of our current digital canon. Scholars 
spent years digitizing and recovering texts, providing open access 
versions of work by authors who had been understudied. Every 
text that has disappeared has erased crucial knowledge that will 
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now need to be recovered or, worse, undertaken once more. The 
loss of a digital presence of an often reproduced canonical author 
may not hurt the awareness and study of the canonical author, 
but the loss of the single digital instance of a recovered text by a 
little known author will, in effect, send the author back into the 
hidden archives that scholars have worked to expose. Addition- 
ally, such built projects provide important cultural and historical 
data that reveal understandings of criticism and culture at a par- 
ticular historical moment. It matters that scholars and fans chose 
to dedicate scholarly sites to certain authors and not others. The 
loss of digital recovery projects leaves gaping holes in our under- 
standing of late twentieth-century literary scholarship. Digital 
humanists are fond of talking about sustainability as a problem 
for current and future works, but it is clear that we already have 
sustained a good deal of loss within the broadly defined digital 
canon. 

While a good many of the early small-scale digital projects 
have been displaced or lost from our current digital canon, a few 
have managed not only to survive but to thrive and offer models 
of best practices for recovery and sustainability. Projects created 
in connection with an institutional structure, a university or etext 
center, have had a much better chance of surviving, in part because 
institutions understand standards and preservation approaches, 
have a greater possibility of staff dedicated to such work, and see 
the projects as part of past investments of the institution and, as 
such, economically and intellectually important. For example, the 
Victorian Women Writers Project was launched in 1995 and quickly 
became known as an exemplary digital recovery project. By mid- 
2000, however, the project was faded. The markup was old and in- 
complete, search capabilities were limited, and the project looked 
dated. In 2010 the Indiana University Library team, headed by 
Michelle Dalmau, refurbished the project.*! Texts were recoded 
to meet TEI Ps standards and additional materials, including 
"newly encoded texts and related contextual materials like criti- 
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cal introductions, biographical sketches, and annotations," were 
added. Harnessing the expertise of digital librarians at Indiana 
University, graduate and undergraduate students contributed 
to the revision.? The same support is usually not offered to the 
one-off, stand-alone DIY projects outlined at the beginning of 
this chapter, which suffer from their position outside the institu- 
tional structure. Other projects have found institutional support 
from digital organizations, such as NINES (Nineteenth-Century 
Scholarship Online). The project 19: Interdisciplinary Studies in 
the Nineteenth Century was initially formed as a simple HTML 
journal. Scholars affiliated with the project participated in a 2005 
NINES summer workshop during which they learned to encode 
their documents with the international standard of TEI/XML. 
Once the project was re-marked with TEI, it was brought into 
the NINES federated collection of nineteenth-century materials, 
which expanded its user base, allowed scholars to use different 
tools to manipulate and examine the materials, and increased the 
project chance of long-term sustainability. 

The attention to infrastructure is central to what might be the 
most successful early recovery project, the Women Writers Project. 
'The project is designed "to bring texts by pre-Victorian women 
writers out of the archive and make them accessible to a wide 
audience of teachers, students, scholars, and the general reader,” ® 
a goal shared by many of the early digital DIY projects. Begun 
in the department of English at Brown University, later moved 
to Computing and Information Services and, finally, the Brown 
library, the WWP has always demonstrated its connection to the 
mainstream of academia. The founders of the project, Susanne 
Woods and Stuart Curran, brought legitimacy to the project 
since they were known scholars in the subject area who could 
command respect for their work, even when that work was being 
disseminated through digital means. The advisory board and ini- 
tial editors were or have become leaders in their respective fields. 
In addition, the strategic alignment of the WWP with a highly 
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respected press, Oxford, and university, Brown, suggests that the 
effort to position the project as inside the academy not only aids 
legitimacy of the project but increases its ability to be sustained.* 
Cleverly tying the new with the old, Woods suggests that har- 
nessing traditional values and approaches to scholarship aid the 
sustainability of the WWP. As Woods describes the project, “The 
long-term future of the WWP, then, lies in its position as a bridge 
between old ways of reading and new."? Infrastructures of sup- 
port, then, are a necessity in digital project preservation. 
Infrastructure support extends beyond institutions to individ- 
uals who work in collective communities. Active communities are 
the difference between preservation and loss. Curated hyperlink 
projects, for example, are structured to take advantage of active 
communities that produce webpages. As communities lose inter- 
est, projects disappear leading to the death of links, a process that 
eventually renders the curated hyperlink project unusable. An ac- 
tive and interested community, however, is incredibly powerful 
as demonstrated by viral archiving examples. Viral archiving is 
the replication of a text for use or distribution. Perhaps the most 
studied text preserved through viral archiving is Aggripa (A Book 
of the Dead), a born digital art collaboration of Dennis Ashbaugh, 
William Gibson, and Kevin Begos Jr. Matthew Kirschenbaum 
and Alan Liu’s exemplary work on Aggripa reveals that “the text of 
the poem as it circulates online today is not a digital copy whose 
bits were lifted from one of the project's diskettes, but rather the 
result of a manual transcription of a video cassette . . "6 The 
hackers who chose to preserve the poem were responding to 
the unique challenge of a software text that was designed to be 
unarchivable, and though we don't have an exact replica of the 
original poem, a surrogate for study has remained through the 
communities' efforts. Similar efforts include the reproduction of 
a poignant letter written in 1795 by Judith Cocks, a slave in Con- 
necticut, to her master. Originally recovered by the Early Ameri- 
can Women Writers website," the letter was replicated in an entry 
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on the Women of Color in Accounting Facebook page.** The 
reproduction of the letter on a nonacademic page is not uncom- 
mon; “The Marriage of Okiku-San,” a short story found within 
the Winnifred Eaton Digital Archive and reprinted by Legacy, is 
preserved in various sites including a genealogy site and various 
Chinese and Japanese sites.? While viral archiving successfully 
preserves texts, the happenstance approach to such preservation 
is by no means an appropriate long-term strategy if we are inter- 
ested in producing a comprehensive humanities data set. 

Crucial to establishment of an inclusive set of digital literary 
texts is an understanding of the role of technological standards 
in preservation and canonization. Digital humanists are well 
aware of the importance of updating technologies for preserva- 
tion, and successful projects such as the Walt Whitman Archive 
have made the transition from HTML, to SGML, to TEI/XML 
metadata structures. However, technology standards have im- 
pacted the ways that materials are utilized, leaving some digital 
recovery projects unused and excluded from what is developing 
as technological standards that are elemental to the formation 
of what is and is not included in the digital literary canon. Ken- 
neth Price alludes to the centrality of technology standards in 
canon selection when he argues that *people ready to embrace 
high quality work wherever it is found hold in highest regard 
digital work that features a rigorous editorial process and adheres 
to international standards (for example, TEI/XML).”® The no- 
tion of technological standardization as a marker of high quality 
digital work provides another clue to the displacement of digi- 
tal recovery texts. An MLA bibliography search includes almost 
one thousand digital literary projects, yet the vast majority of 
the small-scale, older recovery projects included by MLA are in- 
visible in current digital literary scholarship. Glynis Carr’s The 
Online Archive of Nineteenth-Century U.S. Women’s Writing, active 
from 1997 to 2001, is one such project listed in the MLA bibli- 
ography. A number of the texts included on the site, such as Aunt 
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Lindy by former slave Victoria Earle Matthews, are not available 
in digital form elsewhere. Though Matthews’s text would be very 
useful for those examining nineteenth-century issues of religion 
and race in postbellum America, the text is not considered part of 
the current digital literary studies canon in large part because of 
its technological infrastructure. The project shares characteris- 
tics with other dying or lost digital recovery projects. Glynis Carr 
produced the project with a small number of students, without a 
digital humanities center or library support, and published the 
project on her personal staff page at the university at which she 
is employed. Carr is also not active within the digital humani- 
ties community. All of these reasons may contribute to the lack 
of attention given to the site, but perhaps most damaging is that 
the site was produced in HTML, a form resistant to preserva- 
tion strategies and noninteroperable with current datamining 
approaches. A common strategy in early digital recovery proj- 
ects, the use of simple technologies for digital project creation, 
now mark projects as “low quality,” a stigma that creators of the 
projects are well aware. Interviews with Jean Lee Cole, creator 
of the Winnifred Eaton Digital Archive, Sharon Harris, creator of 
the Early American Women's Website; and Donna Campbell, cre- 
ator of American Authors reveal that all three scholars believe that 
current technological standards have stymied project preserva- 
tion. Harris, for example, reports that she removed her EAWW 
project because it was “not in a sophisticated platform."?! Cole 
and Campbell both view their projects as excluded from digital 
literary scholarship primarily due to technological approaches. 
Some digital scholars see the overdetermination of technol- 
ogy as a disturbing trend that leads to a narrow digital litera- 
ture canon. Amanda Gailey and Andrew Jewell call this trend “a 
‘hipster ethos’” that makes “absent” “celebrations of content-rich 
digital humanities projects."? Gailey and Jewell are concerned 
with current production of digital editions, but the emphasis on 
technological standards has an equally damaging impact on the 
inclusion of a diverse canon. 
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Barriers to digital recovery projects are also found in the 
current ways that we theorize what is an appropriate "text" for 
digitization and analysis. Margaret Ezell cautions that we have 
not revised the way in which we understand texts and because of 
this elision certain texts, particularly noncanonical texts, are not 
being digitized. She argues that “while we increasingly have the 
ability to digitalize any text we please . . . editors do not please to 
select certain types of material and this is in part because perhaps 
we are not yet changing some of the basic assumptions about 
what an ‘edition’ does, or in Hunter’s terms, what is ‘appropri- 
ate.” Clearly the question of canon is still in flux, regardless of 
the belief that such concerns were resolved by the culture wars of 
the last century. We might also have a historical problem in the 
very emergence of digital humanities that contributes to the se- 
lection of materials for digitization. Martha Nell Smith contends 
that digital humanities developed as a space to which practitio- 
ners fled from the shifts in the profession that arose out of the 
cultural studies movement. In “The Human Touch: Software of 
the Highest Order, Revisiting Editing as Interpretation,” Smith 
highlights the digital humanities’ retreat into modes of analytics, 
objective approaches as “safe” alternatives to the messy fluidi- 
ties found in literary studies. She notes, “It was as if these mat- 
ters of objective and hard science provided an oasis for folks who 
did not want to clutter sharp, disciplined, methodical philosophy 
with considerations of the gender-, race-, and class-determined 
facts of life . . . Humanities computing seemed to offer a space 
free from all this messiness and a return to objective questions of 
representation.”** If Smith is correct, then it should be no sur- 
prise that the recovery of messy lost texts has not been a priority 
for the field. 

Additional reasons for the exclusion of particular texts are eco- 
nomic. Citing a number of press directors, John Willinsky points 
to the continued cost of scholarly publication due to the contin- 
ued need for press staff.” As Kenneth Price has so adroitly stated, 
digital materials might be free for those who use the project, “But 
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free stuff comes from somewhere, and it is rarely, if ever, free to 
produce.”* Price, who has successfully guided the Walt Whitman 
Arcbive through numerous grant applications, emphasizes that 
"the creation of digital editions is expensive, and the demand for 
external and internal grant support always exceeds the money 
available. We need to reflect, then, on what gets funded and what 
does not and to take care not to institute an even more narrowly 
conceived canon than in the past." NEH grants, which fund a 
majority of the digital literary projects, are often judged by im- 
pact and impact is most recognized by numbers of hits to the 
site. We know that canonical writers have a greater chance of 
a large following than little known writers. So, it follows that a 
good number of the archive and edition projects are focused on 
canonical writers, such as Rossetti, Melville, or Whitman. *What 
is marketable in terms of digitalization projects are . . . edito- 
rial projects covering great vast expanses of materials of varying 
natures," states Margaret Ezell, ^and that digitalization in this 
instances ‘sells’ because of its ability to include ‘everything’ and 
link it in a comprehensible searchable and sustainable system." 
This system tends to exclude outlying texts, including the “single, 
uniquely existing manuscript.”** Such structural concerns need 
to remain at the fore of our work with digitization, since without 
careful attention to issues of the canon, we risk creating a digital 
canon that looks more like a New Critical canon than a contem- 
porary, diverse body of literature. 

What I am arguing seems to move against current digital hu- 
manities trends regarding preservation. As Kathleen Fitzpatrick 
and Matthew Kirschenbaum, among others, have argued, digital 
forms are far more stable than most critics presume. Forensic 
computing is able to recover data etched on hard drives and viral 
preservation has preserved games, texts like Aggripa, and even 
some of the recovered texts this chapter has reviewed. However, 
viral preservation or even planned obsolescence are not appropri- 
ate for certain kinds of materials, such as recovered texts. While 
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we might bemoan the loss of Bethany Nowviskie’s early digital 
project, The John Keats Hypermedia Archive, we can easily access 
Keats materials in print or even online.” The majority of the 
early digital recovery work discussed in this chapter, however, is 
not easily accessible. Few of these texts are in print. A few more 
are available on for-profit databases or on microfilm, but most 
are available only with a return to the one or two libraries that 
own the original physical copy of the book, journal, or newspa- 
per. The targeted preservation of such materials is not technical, 
as Matthew Kirschenbaum has taught us, but social, and as a field 
we need to put such materials at the front of the preservation 
queue. 

Central to our efforts is the need to build a community in- 
vested in the consideration of the digital literary canon. We need 
an initial targeted triage focused on digital recovery projects that 
reproduce materials not accessible in print or open access digi- 
tal form. Successful models exist. We eventually need a thought- 
ful, targeted approach to recovery and preservation, but until we 
have resolved the larger digital preservation problem we might 
think in short-term increments. In “The Future of Preserving the 
Past,” Dan Cohen writes, “Worrying too much about the long- 
term fate of digital materials in many ways puts the cart ahead of 
the horse . . . Instead of worrying about long-term preservation, 
most of us should focus on acquiring the materials in jeopardy in 
the first place and on shorter-term preservation horizons, 5 to 10 
years, through well-known and effective techniques such as fre- 
quent backups stored in multiple locations and transferring files 
regularly to new storage media, such as from aging floppy discs 
to DVD-ROMs. If we do not have the artifacts to begin with, we 
will never be able to transfer them to one of the more permanent 
digital archives being created by the technologists.”® Cohen's 
suggested approach won't resolve our preservation problems, but 
he reminds us that simple, short-term solutions might provide a 
stopgap measure for the hemorrhaging of recovery projects. 
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Existing organizations, including digital humanities centers, 
collectives, and existing projects, have preserved recovery proj- 
ects. For example, the University of Nebraska's Center for Digital 
Research has provided support to projects that otherwise might 
have disappeared, such as Marta Werner's Radical Scatters. NINES 
and its sister organizations under the Advanced Resource Con- 
sortium (ARC) have established networks that foster and support 
various forms of digital work including recovery. Another avenue 
for increasing preservation is to incorporate legacy projects into 
current digital projects. Given the collaborative, extensible, in- 
teroperable nature of digital work, there is no reason why con- 
temporary projects might not be connected to legacy projects. 
In effect, communities of interest are best able to engage with 
discussions of and experiments to preserve crucial early digital 
recovery texts. The communities of interest must also engage in 
the shifting technological and economic requirements of digital 
text recovery. Consortiums might be built to target the quickly 
disappearing early digital recovery projects. Infrastructures 
might be proposed to preserve instances of the various projects. 
Such approaches are best built into existing societies or organiza- 
tions, particularly those organizations with an activist bent. For 
example, the College Language Association has long focused on 
African diaspora texts. Should the organization choose to act as a 
hub for recovery and preservation efforts, the effort would serve 
as an extension of work already underway. 

One of the powerful things about the early period of digi- 
tal literary studies is the DIY approach that many scholars em- 
braced, the sheer joy and freedom of bringing important texts to 
the larger scholarly community. As we move from simple HTML 
sites to TEI and visualization projects, as we move from indi- 
vidual or small collective projects to larger team projects, from 
nonbudgeted projects to large, externally funded projects, we see 
fewer scholars working with digital textual recovery. This should 
concern digital humanists, and we should, accordingly, begin to 
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strategize how we might reverse this trend. Small steps are un- 
derway. We need to examine the canon that we, as digital hu- 
manists, are constructing, a canon that skews toward traditional 
texts and excludes crucial work by women, people of color, and 
the GLBTQ community. We need to reinvigorate the spirit of 
previous scholars who believed that textual recovery was crucial 
to their work, who saw the digital as a way to enact changes in 
the canon. If, as Jerome McGann suggests, “the entirety of our 
cultural inheritance will be transformed and reedited in digital 
"5! then we must ensure that our representation of culture 
does not exclude work by those writers previously excluded. 


forms, 


CHAPTER 4 


Data and the Fragmented Text 


Tools, Visualization, and Datamining 
or Is Bigger Better? 


Personally, I think Digital Humanities is about building 
things. I'm willing to entertain highly expansive definitions 
of what it means to build something. I also think the disci- 
pline includes and should include people who theorize about 
building, people who design so that others might build, and 
those who supervise building (the coding question is, for me, 
a canard, insofar as many people build without knowing how 
to program). I'd even include people who are working to re- 
build systems like our present, irretrievably broken system 
of scholarly publishing. But if you are not making anything, 
you are not—in my less-than-three-minute opinion—a digi- 
tal humanist. You might be something else that is good and 
worthy—maybe you're a scholar of new media, or maybe a 
game theorist, or maybe a classicist with a blog (the latter be- 
ing very good thing indeed)— but if you aren't building, you 
are not engaged in the *methodologization" of the humani- 
ties, which, to me, is the hallmark of the discipline that was 
already decades old when I came to it. 


—Stephen Ramsay, ^Who's In and Who's Out,” History 
and Future of Digital Humanities Panel, MLA, 2011 


The first chapters of this book trace the literary approaches— 
textual studies, new historicism, and cultural criticism—that un- 
derlie contemporary digital literary scholarship. Sharing a the- 
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oretical terrain with traditional literary scholarship, the digital 
work I have traced in the first part of this book has been largely 
representational, with technology primarily used to create ideal- 
ized or better versions than would be possible in print. Current 
trends in digital literary studies, and the larger digital humani- 
ties, appear to be moving away from representational concerns 
and toward interpretive functions as contemporary digital schol- 
ars, such as Stephen Ramsay, Franco Moretti, Matthew Jockers, 
Geoffrey Rockwell, and others, are using technology to devolve, 
manipulate, and reform the literary text. Arguing that earlier 
digital literary studies materials “seldom are . . . transformed al- 
gorithmically as a means of gaining entry to the deliberately and 
self-consciously subjective act of critical interpretation,” these 
critics want to move the practice beyond what Ramsay calls “fact- 
checking” and instead employ technology to “assist the critic in 
the unfolding of interpretive possibilities.”! Though Matthew 
Jockers views computational textual analysis as “the foundation 
of digital humanities and its deepest root;" digital literary stud- 
ies has largely ignored computational approaches until recently. 
If the preceding years of digital literary studies have been marked 
by continuity with traditional methodologies and practices, cur- 
rent work signals a potential break from the past. Scholarly anal- 
ysis is being altered by algorithmic approaches that are beginning 
to produce evidence that might answer the long-standing digital 
humanities claim of presenting new findings through technologi- 
cal interventions, what might be called technological interpreta- 
tion or algorithmic interpretation.’ By focusing on tool develop- 
ment, visualization, and datamining, three crucial subareas of the 
interpretive bent of digital studies, this chapter will unravel how 
digital literary studies is beginning to shift from representation 
to interpretation. 

Challenges to representational applications of technology 
began in digital literary studies in the early 2000s. In his 2001 
volume, Radiant Textuality, Jerome McGann condemned the 
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primary methodologies of digital scholarship, *methods of sort- 
ing, accessing, and disseminating large bodies of materials, and 
on certain specialized problems in computational stylistics and 
linguistics,” and issued a prophetic challenge: “the general field 
of humanities education and scholarship will not take the use of digi- 
tal technology seriously until one demonstrates how its tools improve 
the ways we explore and explain aesthetic works—until, that is, they 
expand our interpretational procedures.”* McGann’s University of 
Virginia compatriot, John Unsworth, would expand McGann’s 
call for interpretation when, in 2003, he challenged scholars to 
“demonstrate the usefulness of all the stuff we have digitized over 
the last decade and more—and usefulness not just in the form of 
increased access, but specifically, in what we can do with the stuff 
once we get it: what new questions we could ask, what old ones 
we could answer." Unsworth’s phrase “what new questions we 
might ask” became the mantra of a subsection of the field, largely 
driven by tool production as the means by which scholars might 
answer Unsworth’s charge, ushering in what Ramsay calls an “age 
of tools.” At the 2006 MLA meeting, Stephen Ramsay? summed 
up the tools movement: 


Over the last twenty years, we have spent millions digitiz- 
ing texts and putting them online. The resulting digital 
full-text archives are among the greatest achievements in 
digital humanities. Yet for all their wonder, they remain 
committed to a vision of digital textuality firmly ensconced 
within the metaphor of the physical library. You can browse 
the text, read the text, search the text, and even download 
the text, but you can’t really do much beyond that. It is time 
to start thinking of ways to exploit this data with analyti- 
cal tools and visualizations. Ideally, such tools should be an 
integral part of the experience of working with Web-based 
text collections.’ 
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The emphasis on interpretive tools gained momentum and tool de- 
velopment and use became a focal point of digital literary studies. 
As interest in tools grew, scholars began to realize that match- 
ing technology to use was central to an expansion of technologi- 
cal interpretation. The 2005 Summit on Digital Tools for the 
Humanities issued a report that, while largely speculative, identi- 
fied the key concepts of interpretation, exploration of resources, 
collaboration, and visualization of time, space, and uncertainty 
that remain central to technological interpretation. Recognizing 
that “the use of digital tools in the humanities is, for the most 
part, still in its infancy,” the report calls for development “of strat- 
egies to aid scholars in the use or re-use of existing tools” rather 
than “the creation of wholly new tools that are specially designed 
for the humanities.”* The report was responding to the tension 
between what John Bradley views as two categories of tool use: 
those tools “developed specifically to support humanities schol- 
arship” and those “not developed by humanists” but “proven .. . 
to be powerful tools when applied to scholarly tasks."? Scholars 
interested in constructing tools to support digital work quickly 
found that tool development was expensive and difficult and 
often led to highly idiosyncratic, nonextensible, and unsustain- 
able tools.'? Tools would be developed that were effective in a 
particular project but were not interoperable with other digital 
texts. For example, Collex, the collection and exhibit builder tool 
developed to work with the NINES dataset, is a robust tool but 
difficult to extend to other projects, requiring customization of 
the database. Other tools, such as Wordhoard and Protovis, are 
no longer updated, leading to possible problems. Other projects, 
such as MONK, were given limited funding and are ultimately 
less useful than experimental. Each of these problems contributes 
to the creation of humanities software with high experimental 
impact but low long-term value. For this reason, the summit en- 
couraged scholars to find ways of using extensible tools rather 
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than creating idiosyncratic tools. Io do so, though, attention 
to disciplinary purpose must be central. Johanna Drucker cau- 
tions digital humanities scholars that “[t]he cultural authority of 
digital technology is still claimed by the fields that design the 
platforms and protocols on which we work."!! Arguing that tools 
that represent place and space are resistant to humanities’ ap- 
proaches, Drucker maintains that “[f]lexible metrics, variable, 
discontinuous, and multi-dimensional will be necessary to realize 
a humanistic system for the graphical analysis of temporal rela- 
tions.” The various problems raised by tool development are 
to be expected in this early, exploratory period. As the summit 
report notes, “When information technology is introduced into 
a discipline or some social activity there seem to be two stages. 
First, the technology is used to automate what users were already 
doing, but now doing it better, faster and possibly cheaper. In the 
second stage (which does not always occur), a revolution takes 
place."? We appear to be moving out of the first stage of technol- 
ogy automation to the proposed revolutionary shift in disciplin- 
ary practice. As we move into new uses of technology, however, 
Drucker’s call for theoretical engagement with technology must 
remain central. 

‘Tool development may be de rigueur in digital literary studies, 
but the purpose of such tool development tends to fall into two 
categories: representational and interpretative. Tools constructed 
for seemingly similar purposes, like the Versioning Machine and 
JUXTA, reveal crucial theoretical differences that mirror the ten- 
sion between representation and interpretation. Launched at the 
ALLC/ACH conference in 2002, Susan Schreibman’s Versioning 
Machine (VM) “was conceived as an exploration into how the 
computer as a medium and as a tool can represent text that exists 
in multiple versions.”'* Designed to display multiple versions of 
TEI/XML encoded texts, the VM is a tool to make digital editing 
easier (see fig. 4.1). 

The tool allows the display of complicated TEI/XML docu- 
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New Version e Bibliographic Info * Line Numbers: V e| Nonotes found * | * Index of texts 


"There are two Ripenings" 
by By Emily Dickinson. 
Original Source 
See Witness List. 


Witness List 


* Witness A456: A 456, version sent to 
unknown recipient. 
* Witness ATr60a: A Tr60a, version 


transcribed by Todd and sent to Anthon 
* Witness H47: H 47, Fascicle 14 


Textual Notes 


Render: Additions appear in a green, foxed-width 
font. 


Electronic Edition Information: 


Fig. 4.1. Versioning Machine 


ments that feature parallel segmentation for greater ease of ed- 
iting, making the VM a tool of great use to the production of 
the complicated digital editions highlighted in chapter 2. The 
Versioning Machine is knowingly representative and holistic 
“unlike, for example, other electronic display paradigms such as 
TextArc or TagClouds, whose goal is to provide new ways of vi- 
sualising the text through deformance by removing words from 
their syntactic contexts." Designed to represent a simulacrum 
of a text for analysis by a scholar, rather than to probe for mean- 
ing through technological intervention, the VM has the primary 
goal of allowing the user to more easily edit texts. JUX TA, on the 
other hand, is constructed to resist representation as it is focused 
on “procedures of interpretative deformation."!^ Like the Ver- 
sioning Machine, JUXTA compares versions of texts and allows 
scholars to to complete various tasks related to digital edition 
building including TEI/XML encoding." Unlike the Version- 
ing Machine, JUXTA is designed to work with a broader set of 
document forms, XML, HTML, DOC, RTF, PDF, and PUB. 
'The decision to allow the user to input multiple format types, 
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including unformatted RTF files, signals a very different under- 
standing of the role of technology. The Versioning Machine's 
display is built on the premise that users may only input TEI/ 
XML encoded documents, in effect suggesting that the VM will 
display the documents in the exact way that the editor interprets 
the document. With TEI, not only will the structure of the text 
be defined but the interpretation will be defined, highlighting the 
VM goal of allowing the editor control over the text. In opposi- 
tion to this approach, the ability to input less rigorously defined 
texts to JUX TA indicates the ceding of control to an interpretive 
use of technology, of allowing the program to turn the text into 
data to create multiple visualizations. Accordingly, JUXTA uti- 
lizes algorithmic functions to analyze, compare, and display the 
text in various ways including heat maps, side-by-side compari- 
sons, and histograms (see figs. 4.2—4.4). 

The tool is designed to create “interpretive comparisons” that 
the user might then examine to fulfill “the basic humanities inter- 
pretive act: critical comparison.”!* The imagistic understanding 
of textual difference, the histogram graphs and heat maps, are 
built by carving the texts into discreet data points and then dis- 
playing the points in a form that the user must then rebuild into a 
pattern. While both programs might be used to successfully edit 
a text, the use of technology and the theoretical understanding of 
a text remain crucially different in the tools. 

As tool use gained momentum and interpretive possibilities 
were revealed, digital humanities scholars began to argue that 
tool development, or building, might be understood as an intel- 
lectual act, an argument. Foremost in the calls for such an ap- 
proach has been Stephen Ramsay, who set off a firestorm when he 
announced at the 2011 MLA: “If you're not making anything . . . 
you're not a digital humanist.” The value of building, according 
to Ramsay, is interpretive by nature: 


DH-ers insist—again and again—that this process of cre- 
ation yields insights that are difficult to acquire otherwise. 
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Fig. 4.3. Side-by-Side Comparison 


Its the thing I've been hearing for as I long as I've been 
in this. People who mark up texts say it, as do those who 
build software, back social networks, create visualizations, 
and pursue the dozens of other forms of haptic engage- 
ment that bring DH-ers to the same table. Building is, for 
us, a new kind of hermeneutic—one that is quite a bit more 
radical than taking the traditional methods of humanistic 
inquiry and applying them to digital objects.’ 
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Fig. 4.4. Histogram 


In Ramsay’s conception of building, construction is a means of 
theorizing where creation of a technological object is a sustained 
intellectual argument in and of itself. l'here is no need for a writ- 
ten description or analysis. In “How a Prototype Argues,” Galey 
and Ruecker argue that "digital artifacts themselves—not just 
their surrogate project reports—should stand as peer-reviewable 
forms of research, worthy of professional credit and contestable 
as forms of argument." Correlating the intellectual process of 
building to writing, Ramsay and Rockwell note: “If the quality of 
the interventions that occur as a result of building are as interest- 
ing as those that are typically established through writing, then 
that activity is, for all intents and purposes, scholarship."?! The 
idea of tools as stand-alone artifacts representative of scholarly 
intent has met significant resistance in large part because the idea 
displaces the written analysis as the gold standard of scholarship. 
Stefan Sinclair states, “The tools banish the text, that with which 
the literary critic is most familiar. Even tools that offer functions 
to view the immediate context of words can hardly be satisfying: 
Snippets of a text are tethered out in arbitrary chunks usually 
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defined by character or word counts rather than natural textual 
boundaries such as paragraphs."? The replacement of the ho- 
listic text imbued with a materiality that represents the physical 
object with a fragmented object of study, in effect a concerted 
effort to break a text into discrete data points as is the case with, 
say, JUXTA, is a centerpiece of the digital as interpretation and 
what is often viewed as a direct challenge to traditional literary 
practice. 

Most nondigital literary scholars are resistant to viewing tools 
as theoretically constructed arguments, seeing such an under- 
standing of tools as antithetical to humanistic inquiry. When 
I ran a group session on data at the NEH sponsored NINES 
workshop on Evaluating Digital Scholarship, I asked participants 
if it was possible to assess tools as discrete scholarly artifacts. 
Most participants were uncomfortable with such an approach 
and instead argued that an article that discussed the results of the 
technological intervention was the only form of scholarship that 
might be evaluated. This is not an unsurprising response and one 
that has deep roots in disciplinary practice. Literary scholars are 
not trained to use algorithms nor to construct and apply tools to 
literary texts. Data representation and analysis is more closely 
associated with science and engineering within the specialized 
culture of academia than with literary analysis, and resistance to 
such work often arises out of tension between science and the 
humanities, a la C. P. Snow. As university funding and prestige 
is increasingly seen to correlate with S'TEM areas rather than 
humanities fields, humanities scholars have become increasingly 
suspect of what appear to be scientific approaches encroaching 
on traditional humanistic studies. As Stephen Ramsay remarks, 
“When we introduce computers into the field of interpretation, 
we bring along the inheritance of a popular conception that has 
always associated computers with the inexorable calculus of fact 
and truth."? As I have discussed earlier in this book, applied 
work, such as edition production, is not highly valued in con- 
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temporary literary studies. Tool development and use is often 
seen as application, and scholarship utilizing such approaches is 
seen as producing mechanized and simplistic treatments of com- 
plex cultural productions. The shift to recognizing building as a 
complement to traditional theoretical concerns is discomforting 
to many scholars, with scholars voicing apprehensions “that the 
methodology might overwhelm the humanities content, might, 
in fact, become the content.”** The concern that technology or 
tool might become a primary object of study, displacing the liter- 
ary text, threatens the centrality of long-held beliefs in the hu- 
manities. If we are to validate tool use we must, as Ramsay makes 
clear, provide "active resistance against the perception that we 
are out to provide scientific solutions to interpretive problems."? 
To do so necessitates careful interrogation of structures of tech- 
nological intervention. Christine Borman stresses that “humani- 
ties scholars need to be particularly attentive to unstated assump- 
tions about their data, sources of evidence, and epistemology. 
We are only beginning to understand what constitute data in the 
humanities, let alone how data differ from scholar to scholar and 
from author to reader." Should we treat “data as a matter of 
disciplines," full of interpretive possibilities, then we might en- 
gage with what Gitelman and Jackson term “the imagination of 
data." If we do not engage with an investigation of data within 
an interpretive framework, then we will have ceded meaning 
making to other disciplines and incompletely understood how to 
pair technology with use. 

Another important area of growth in interpretive analysis is 
visualization. Visualization of texts is not new to the humanities. 
Instances abound in print work, according to Geoffrey Rock- 
well, with examples including 1970s pictorial concordances, 
OED project visualizations, and texts that include plots and 
graphs, such as Bordeiu's Homo academicus?’ Current digital 
efforts, wordhoard, nora, TextArc, Monk, ‘Tapor, Voyant (Voy- 
eur), and Hyperpo, focus on “patterns in data—dynamic graphs, 
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charts, maps, plots, etc"? Unlike early models, which, in the 
most liberal sense, are representations of scholarly interpreta- 
tions, current experiments in visualization are representations 
for scholarly interpretation. The MONK (Metadata Offer New 
Knowledge) project, led by John Unsworth and Martin Muel- 
ler, is an experimental project that allows scholars to explore 
patterns in texts that will “produce new knowledge by exposing 
unanticipated similarities or differences, clustering or dispersal, 
co-occurrence and trends.”*° MONK provides “scientific visual- 
izations, where numeric data about the collections are presented 
in visual forms, and humanities visualizations, where text data 
are presented in visual ways that may or may not include ty- 
pography."?! Various visualizations produced by MONK include 
dialR, Repetition Graph, and Walls of Text. Each of these tools 
is premised on the idea that the computer is able to make inter- 
pretive interventions into a text based on the scholarly team’s 
expression of repetition through an algorithm. Put simply, the 
MONK team constructs an argument about word repetition, 
expresses that argument through an algorithm, and develops 
visualization tools that display the output of the algorithmic 
run for scholarly interpretation. MONK results are premised 
on a fragmented notion of text, a conversion of the seemingly 
whole text into data. However, the developers emphasize that 
while the materials are treated algorithmically as discrete datum, 
the tool’s purpose is equally invested in context: “Shuttling be- 
tween the ‘micro’ and the ‘macro’ is a distinctive feature of the 
MONK environment, where you may read as closely as you wish 
but can also practice many forms of what Franco Moretti has 
provocatively called ‘distant reading.’”*” Distant reading is a re- 
sponse to concerns that scholars have analyzed only a “minimal 
fraction of the literary field”: “a canon of two hundred novels, 
for instance, sounds very large for nineteenth-century Britain 
(and is much larger than the current one), but is still less than 
one per cent of the novels that were actually published: twenty 
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thousand, thirty, more, no one really knows—and close read- 
ing won't help here, a novel a day every day of the year would 
take a century or so.” To Moretti: “distance is however not an 
obstacle, but z specific form of knowledge: fewer elements, hence 
a sharper sense of their overall interconnection. Shapes, rela- 
tions, structures. Forms. Models.”** Moretti’s insistence on 
macro analysis raises the specter of mechanization for some crit- 
ics. For Katie Trumpener, Moretti "shows statistical analysis as 
a relatively blunt hermeneutic instrument, redeemed mainly by 
Moretti’s own exegetical verve.”** Trumpener goes on to argue 
that Moretti’s brilliant close readings are actually what make his 
distance readings work. Such fetishization of close reading, what 
Jane Gallop calls “the most valuable thing English ever had to 
offer” and “the thing that made up a discipline, that transformed 
us from cultured gentlemen into a profession,” has, it seems, led 
some to view data-driven approaches to textual interpretation 
with horror.” However, Moretti himself rejects the positioning 
of distance reading as oppositional to close reading: “quantita- 
tive series, for their part, allow us to see new problems, whose 
solution is usually found at the level of formal choices (linguistic, 
rhetorical, or a mix thereof). The specific relationship between 
literary form and nonliterary model varies from case to case; but 
the relationship is always there." By viewing close and distance 
reading as approaches that are fluidly selected by the critic for 
particular forms of criticism or in response to the dataset forma- 
tion, Moretti's technique offers an answer to Unsworth's charge 
of using technology to locate new questions. Instead of viewing 
the two approaches as binary, the techniques might be seen as 
selectively applicable. Timothy Burke agrees that one approach 
does not preclude the use of another: 


You could still easily argue that a cultural form which oc- 
cupied a miniscule slice of the total cultural activity in a 
given era was nevertheless the most powerful, influential or 
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hegemonic cultural form in that time or place, or that it was 
the key or linchpin of popular culture in that era. You can 
still say that certain kinds of exemplary or highly particu- 
lar works somehow best represent the spirit of a particular 
culture, or best problematize some of its characteristic in- 
ternal struggles and contradictions. You can still do close 
reading of a single text (as either a historian or a literary 
critic) and find it valuable. But both cultural history and 
historicist literary criticism could benefit enormously from 
a truly systematic, carefully quantified account of the total- 
ity of cultural work in any given moment and place.?? 


Shifting micro and macro level readings are an attempt to bal- 
ance two compelling forms of textual analysis: the traditional 
close reading of literary scholarship and new forms of computer 
driven interpretation examining patterns in a larger body of work. 

MONK, then, treats texts as data for manipulation purposes 
but, at the same time, provides an illusion of textual wholeness, of 
the ability to contextualize within the larger cohesive model. The 
emphasis on situating individual repetitions within the larger 
context is crucial to the success of the project. For example, the 
following repetition graph displays individual data points in ways 
that seem to reject a textual cohesion familiar to literary scholars 
(see figs. 4.5 and 4.6). 

Another repetition graph view, however, allows users to situ- 
ate the individual items within a context. 

MONK provides balance between individual data points and 
a contextualized view of the interpretive results, and the tool rep- 
resents an important experimental intervention in visualizations. 
What remains an issue to traditional literary scholars, however, 
is the initial interpretive function that treats text as data points. 
Human interpretation of texts is not easily transferred to algo- 
rithmic functions. Simple tasks, such as “hyphenated words at the 
end of a line or a page” “are major stumbling blocks in work- 


Fig. 4.5. Repetition graph, individual points. (Ruecker et al., *Visualizing 
Repetition in Text.”) 
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Fig. 4.6. Repetition graph, contextualized. (Ruecker et al., *Visualizing 
Repetition in Text.”) 
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flows that aim at creating a document space in which texts of 
different origins can be treated as members of a single corpus."?? 
While those involved with the creation of technological methods 
of interpretation recognize that such roadblocks will eventually 
be resolved, critics of the work see such problems as evidence 
of an intrinsic disconnect between humanistic interpretation and 
technology. 

To address the resistance to our work in visualization and tool 
development we must focus on expanding our experimental ap- 
proaches, sharing our results, and frankly discussing current limi- 
tations in the work. There are models of forward thinking exper- 
imentation, such as Tanya Clements investigation of Gertrude 
Stein's Tbe Making of Americans. Stein's frustrating text has long 
been difficult to analyze using traditional methods, with scholars 
claiming the text “is inchoate—the early work of an inexperienced 
author—and as such its constant repetition represents a style of 
writing that is chaotic, unsystematic, and virtually impossible to 
read."? MONKS' ability to visualize repetition, a motif in Stein’s 
work, makes it the perfect tool for use in the textual analysis, and 
Clements visualizations reveal that Stein groups frequencies of 
repetition by chapter (see fig. 4.7). 

With MONKS visualizations, Clement provides convincing 
evidence that the novel is carefully structured, and that the fre- 
quency groupings were structured to produce meaning, a find- 
ing antithetical to previous readings of Stein's work. Instead of 
viewing the text as fragmented into random pieces, “Each part 
is as important as the whole," a reading that is supported by the 
visualization. However, the literary scholar examining the article 
might remain unconvinced, as methodology and visualizations 
are partitioned from traditional scholarly analysis, reinforcing 
the belief in the incompatibility of algorithmic function within 
humanistic inquiry. This is not an entirely fair critique of the arti- 
cle, as the article’s publication in Literary and Linguistic Computing 
means that the audience skews toward technologically advanced 
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Fig. 4.7. Visualization of repetition over location (x-axis) and length (y- 
axis). 


readers. However, scholars involved in such experimental work 
are challenged to integrate what is often seen as the binary of 
algorithmic interpretation and humanistic analysis. Accordingly, 
if the tool does not easily reveal the underlying assumptions and 
approaches that drive the algorithmic interpretation, as is the 
case with MONK, then greater emphasis on the humanistic in- 
terpretation must be provided by the scholar to assuage concerns 
of traditional humanistic scholars. 

While we should applaud new readings of texts enabled by dis- 
tant reading, and Clement provides one of the best now available, 
we must also recognize that the tool through which the reading 
is created is not neutral and may impact the way in which one 
reads the texts. Just as literary critics recognize that cultural bi- 
ases might appear in their work, tool biases are also possible. This 


Data and the Fragmented Text + 107 


is not a problem with technology or tools. This is a deeper issue 
related to interpretation of all types, and rather than dismissing 
tools for such biases, we should, instead, recognize that tool bias 
might, at this moment, be more apparent than the naturalized 
biases in traditional literary interpretation. We must continue to 
critique machine-aided interpretation in the same way that we 
critique interpretation through traditional means; we must at- 
tend to the “human element” of not only the text but the data 
and the software. One of the most useful ways to begin this 
process is to facilitate exchange between data and tool, output 
and analysis. 

Critics that reject datamining often do so because they believe 
that algorithmic approaches are too generalized, simplistic, and 
unreflective of the complexity of human cultural artifacts. What 
they fail to realize is that current datamining limitations are in 
large part due to the poor data on which we perform algorith- 
mic criticism. For example, Trumpener complains about the lack 
of cross-cultural comparatives in Moretti's work on the British 
book title, which Moretti says is indeed true because the Brit- 
ish dataset was the only data available digitally for his manipula- 
tion. Laura Mandell has quipped, “we don't HAVE data yet."? By 
this, Mandell means that we don't have complete or broad sets 
of accurately digitized materials.? As scholars turned away from 
editing and early forms of digital textual representation to ex- 
perimental uses of technology for interpretation, the digitization 
of texts by scholars declined. For-profit companies like Gale or 
Google seemed to be digitizing our cultural heritage materials 
at a rapid clip, making small-scale scholarly digitization projects 
less attractive and seemingly without use. Unfortunately the texts 
produced by such groups contain gaps in digitization and search 
capabilities that limit their use. Mandell notes: 


scholars believe that the *Gale Group" has taken care of 
it. Gale produced ECCO, Eighteenth-Century Collections 
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Online, a dataset that seemingly single-handedly solves the 
problem of transferring 18th-century texts to digital media. 
There had been a microfilm project for capturing all the 
eighteenth-century texts listed in a renowned bibliography 
in the field—all 400,000 of them. It had begun in the late 
1970s, and libraries all over the Anglo-American world had 
participated, beginning with the British Library. Gale had 
taken over that microfilm collection and created digital 
image files out of 138,000 of the 200,000 that had been 
filmed. But image files aren't data. Having images online 
makes them easier to look at but not fundamentally dif- 
ferent from microfilm. Leaving those texts as image files is 
almost as good as burying them in the backyard.* 


Mandell’s focus on the limitations of searching is also related to 
use. Images are not available for algorithmic criticism without 
translation, and the translation engines, Optical Character Rec- 
ognition, or OCR, are problematic." OCR quality is variable for 
texts, with older texts presenting font problems. The long “s” in 
eighteenth-century texts, for example, is often misread by the 
OCR engine: “So somebody could search for ‘cases’ and look 
through the results and say, ‘Nobody ever sued anybody for di- 
vorce in the 18th century.’ And that wouldn't be true,” says Man- 
dell.“ Diana Kichuk agrees that conversion often produces poor 
data and points to the example of Early English Books Online's 
(EEBO) digital scans made from microfilm copies, which mud- 
dies the image capture due to the microfilm' poor quality. Add 
to this a good bit of *content amputation" and page distortion, 
and the final data becomes unreliable." Google Books has simi- 
lar technological problems including OCR and search limita- 
tions, well documented by critics including Geoffrey Nunberg, 
who has been outspoken in his critique of misnaming, misdating, 
and other metadata errors in Google Books. Unlike proprietary 
databases, such as EEBO or Eighteenth-Century Collections 
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Online (ECCO), there is not a scholarly selection process for 
what Google digitizes. Instead, Google is taking all materials 
found on its partners’ library shelves for conversion. Although 
we might argue that selection occurred when librarians chose to 
purchase or collect these materials, in many ways Google Books 
remains far less constrained than products like EBBO or ECCO. 
Often obscure materials are available from Google Books be- 
cause no one is deciding what goes in and what goes out, unlike 
is the case with for-profit digital datasets that are often severely 
distorted. A 2008 Library Journal review states that ECCO “in- 
cludes nearly every significant English-language title and edi- 
tion published between 1701 and 1800 in the UK, along with 
thousands of important works from the Americas—amounting 
to full-text searching of some 33 million pages of material.”* 
The touted comprehensiveness of EBBO, and other primary da- 
tasets like it, has been called into question by scholars. First, the 
materials included are often incomplete due to their provenance. 
For example, the Early English Books Online (EEBO) draws on 
the Early English Books microfilm facsimile collection, based on 
the Short Title Catalogue of English Books. While there have been 
efforts to extend the Short Title Catalogue of English Books, the 
genesis of EBBO, then, is in the 1970s. This leads to a skewing 
of the types of materials included in the collection due to shifts 
in canonical inclusion. EEBO is incomplete, with estimates that 
it contains “80 percent of the surviving print record in English 
from 1475-1700.” [f we utilize these materials for datamining, 
we will produce stilted results, exactly what critics of datamining 
charge. Instead of rejecting datamining altogether, however, we 
must instead be realistic about the types of data that we use for 
our work, making clear where limitations occur. The lack of ac- 
curacy of these datasets will necessarily be reflected in the types 
of results that we might produce when datamining. 

Those producing experimental datamining tools need to be 
forthright about how limitations of datasets impact their scholar- 
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ship. For example, the creators of MONK argue that, “For users 
of public domain materials, MONK provides quite good cover- 
age of rgth century American fiction, downloadable as TEI P-5 
files, with or without part-of-speech annotation, or available for 
exploration in the user interfaces developed by the MONK proj- 
ect."? Indeed, MONK does provide good coverage, but not good 
enough coverage for scholars of literature. MONK’ test set of data 
is drawn from several databases and digital projects, including 
Documenting the American South, Early American Fiction, EEBO, 
ECCO, 19th Century Fiction, Shakespeare, and Wright American 
Fiction, 1850-75. Of these, Documenting the American South, Early 
American Fiction and Wright American Fiction?! cover American 
texts. In comparing the seventh edition Norton American Litera- 
ture Antbology to the Americanist data available in MONK it is 
clear that the dataset is, at best, spotty. What is unexpected, per- 
haps, is that some later canon additions, such as Frederick Dou- 
glass, Fanny Fern, or Harriet Beecher Stowe, are available while 
canonical authors Emerson, Thoreau, and Jonathan Edwards are 
absent. There is a lack of coverage of Native American texts with 
now canonical writers such as Sarah Winnemucca and Zitkala Sa 
absent. Related materials, such as the captivity narrative of Mary 
Rowlandson, are also absent.? If we were interested in explor- 
ing representations of Native peoples in literature we would be 
neglecting crucial voices if we use this dataset. The issue of data 
inclusivity is not a problem with MONK, the tool. It is instead 
a systemic problem across the whole of humanities. Instead of 
claiming that datamining is able to definitively answer questions 
we must realize that we are in the infancy of algorithmic criti- 
cism. As the MONK project reveals, digital datasets “are large 
enough and rich enough to provide an excellent opportunity for 
text-mining, and we believe that web-based text-mining tools will 
make those collections significantly more useful, more informa- 
tive, and more rewarding for research and teaching." No doubt 
text mining will prove our datasets to be more useful, but "large 
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enough” and “rich” enough are dependent on the types of texts 
under consideration and the algorithmic approach applied to the 
dataset. Certainly sets that exclude particular groups of writers 
are not good enough. At this moment datamining is unable to 
produce accurate findings, and to resolve this problem we must 
continue to expand the data we are examining. The importance 
of good data—complete, accurate, and interoperable—will need 
continual effort over the next decade. 

Unfortunately in our excitement to champion the new algo- 
rithmic form we often overstate our results and downplay the 
issues with our datasets. For example, Matthew Wilkens has 
completed very interesting datamining work using the MONK 
tool, but problems with the dataset make his work a cautionary 
tale. Wilkens uses The Wright American Fiction dataset to exam- 
ine geographical locations mentioned in American literary texts, 
publishing his results in his American Literary History article “The 
Geographic Imagination of Civil War-Era American Fiction.”** 
While Wilkens’s article addresses a very interesting problem and 
is potentially quite revealing, his work is an example of what hap- 
pens when we do not attend to the datasets that we are using. The 
Wright American Fiction dataset was constructed by Lyle Wright 
in 1957 and revised in 1965. A librarian at the Huntington Li- 
brary, Wright inventoried the holdings of eighteen American li- 
braries.? James Harner summarizes the dataset: “A bibliography 
of American editions of separately published American fiction, 
including novels, romances, tall tales, allegories, and fictitious 
biographies and travels but excluding juvenile fiction, jestbooks, 
Indian captivity narratives, periodicals, annuals, gift books, folk- 
lore, tracts published by religious societies, dime novels, and sub- 
scription series.”*° Harner goes on to note that the dataset is “not 
comprehensive,” with the materials excluded from the database 
numerous. Specific novels by African American writers, such as 
Our Nig by Harriet Wilson, are missing and large categories of 
serialized novels and juvenile fiction are excluded. We should 
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also realize that American library holdings and library selection 
criteria would introduce bias to the dataset. The fact that this 
dataset was constructed prior to canon expansion in the 1970s 
and 1980s requires us to be careful about how we use the data. 
Unfortunately, these issues are not reflected in Wilkens's analy- 
sis. Wilkens writes, “There are some squarely canonical works 
included in this collection, including Moby-Dick and House of the 
Seven Gables, but the large majority are obscure novels by the 
likes of T. S. Arthur and Sylvanus Cobb.”*’ The canonical elisions 
need to be considered as part of the analysis, particularly since 
Wilkens is drawing conclusions about canon formation. Had 
Wilkens maintained that his results reflected an understanding of 
literature formed by scholars in the new critical era, the era that 
formed his dataset, his findings would have been relevant. His 
results do not reveal, however, a truth about American literature 
removed from a particular historical context because the dataset 
is limited by its construction. 

Data issues mar the effectiveness of otherwise superb tools. 
For example, the UC Berkley WordSeer tool analyzes grammati- 
cal strings, allows reading and annotation, heat maps, and word 
frequencies. WordSeer is a remarkably useful tool, but the data 
set used to test the tool and the conclusions drawn about the 
texts are problematic. The team chose to examine the slave nar- 
ratives from Documenting the American South, an early digitiza- 
tion project. They ran an analysis on the set to see if the literary 
conventions of the text corresponded with critic James Olney’s 
claim that autobiographical slave narratives include a set number 
of tropes, such as “I was born” or “Cruel slavemaster” or “barriers 
to literacy.”** However, the chosen data set was not appropriate 
for the research question. The three hundred narratives labeled 
slave narratives in Documenting the American South are actually a 
mixed bag of first person narratives that are fictional and nonfic- 
tional, black authored and white authored, pre- and post-Civil 
War, some autobiographical, some biographical, and some anti 
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and some pro slavery, or at least apologetic. Of the narratives that 
the team utilized, a few are authored without intervention, such 
as Frederick Douglass’s 1845 Narrative; others are authored with 
a good deal of intervention, perhaps even manipulation, such as 
The Confessions of Nat Turner; others have been proven false slave 
narratives, such as the Narrative of Archy Moore; and still others 
are colonization religious narratives, such as the Life of Jeduhi 
Ashmun, a narrative that convinced transcendentalist Elizabeth 
Peabody to support the antislavery cause. Given the narratives, 
it is hard to believe that Olney’s criteria, which he says are to be 
applied to black-authored autobiographies, would be possible to 
prove with the chosen test set. As Olney emphasizes, his criteria 
are indicative of a writer who is “not fictionalizing."?? Ultimately, 
WordSeer cannot be used to answer the research question posed 
because the dataset is not up to the challenge. Here I am chal- 
lenging those in the datamining community who dismiss canon 
as a “debate about space on the syllabus"^ or as less about the 
canon of data and more about *who has access to and can ma- 
nipulate the data."^' While I take their point that we do not have 
anything close to an inclusive canon, even that such a thing may 
never be realized, I think it is a mistake to dismiss the issue as 
something that has been resolved or might be sidestepped if we 
want big data to reveal the complexities of literature. It is crucial 
that we carefully examine the contours of a dataset and recognize 
what is revealed and what is hidden. There is still great value in 
the small, scholarly curated data set and we shouldn't think of 
such work as antithetical to large-scale projects, as the limitations 
of our datasets make small-scale datamining approaches far more 
accurate than those that take on a broader scope. If we want to 
be able to fully realize the possible benefits of datamining we will 
need a renewed effort in text digitization that occurs in tandem 
with experimental approaches in datamining, visualization, and 
geospatial representations. Recent critiques have tended to sepa- 
rate the two, a mistake that will damage both efforts. 


II4 * TRACES OF THE OLD, USES OF THE NEW 


What datamining offers most productively at this moment is 
the display of gaps and anomalies. Matthew Kirschenbaum ar- 
gues that where the algorithm, the visualization, or the dataset 
breaks down is a crucial point for scholarly engagement; “we’re 
interested in provocation, anomaly, and outlier results as much 
or more than in what we think the system actually gets right.” 
Such an approach represents an exciting way to understand the 
potential use value of interpretive approaches to technology. Ex- 
emplary work, such as Lauren Klein's “The Image of Absence: 
Archival Silence, Data Visualization, and James Hemings,” sug- 
gests how our experimentation with elision might offer new 
interpretations.? Klein examines the Papers of Thomas Jefferson 
Digital Edition (T7DE), a collection of letters, books, papers, and 
addresses, for what the “archival silences” might reveal about 
James Hemings, Jefferson's enslaved cook and brother to Sally 
Hemings.^ Noting the prevalence of archival silence in *the 
archive of the antebellum United States," and asking pertinent 
questions about how scholars might *identify and extract mean- 
ing from the documents in slavery’s archive,” Klein “demonstrates 
how a set of techniques that derive from the fields of computa- 
tional linguistics and data visualization help render visible the 
archival silences implicit in our understanding of chattel slavery 
today.”® A search for James Hemings’s name in the T7DE reveals 
no hits, reinforcing his absence from the archive. Through care- 
ful and targeted interpretive interventions, Klein is able to reveal 
a great deal of information about Hemings and the social and 
cultural networks he existed within. Through an analysis of in- 
dividual names mentioned in Jefferson’s letters,‘ Klein is able to 
prove that “the arcs that link Jefferson to the men and women he 
enslaved are much more prominent than those that link him to 
his family members and friends, indicating the degree to which 
Jefferson relied on his enslaved plantation staff to implement his 
various directives about such matters such as the purchase of sup- 
plies or the sale of goods"* (see fig. 4.8). 
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Fig. 4.8. Visualization of Jefferson's relation arcs. (Klein, 573.) 
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Klein's work provides a tantalizing solution to the limitations 
of data that we currently experience. While we must continue to 
digitize materials to expand our datasets for interpretation, Klein 
points to possibilities of using what we know to reveal what we 
don't know. As Amanda French insists in “In Praise of Humani- 
ties Data,” data does not have to be comprehensive to be useful.5 
However, we must be transparent about gaps in datasets, reveal- 
ing how we adjust our research methods to accumulate the miss- 
ing information that might skew our findings. 

The possibilities of technological interpretation of literary 
texts are myriad but still in their infancy. Instead of rejecting al- 
gorithmic approaches as flawed, we must focus on the construc- 
tion of data in tandem with experimental algorithmic manipula- 
tions. Literary datasets are clearly incomplete and dirty, but even 
within such data new interpretations might be successfully de- 
rived. On the other hand, we must not dismiss concerns about 
literary data. Algorithmic interpretation depends upon both the 
input and the output, and digital literary studies must actively en- 
gage with both. Instead of overstating the benefits of datamining, 
visualization, and tools, we might best praise such approaches’ 
ability to open materials to interpretive questions, what Ramsay 
rightly calls “the only kind of questions that really matter in the 


humanities."^ 


CHAPTER 5 


Notes on the Future of 
Digital Literary Studies 


As the worldview of code assumes comparable importance to 
the worldviews of speech and writing, the problematics of in- 
teraction between them grow more complex and entangled. 


— Katherine Hayles, My Mother Was a Computer: Digital 
Subjects and Literary Texts, (Chicago: U Chicago P, 
2010), 31 


The rhetoric of these statements (which could easily be 
multiplied) is not one of reform, but of revolution. As Mark 
Sample puts it, “It’s all about innovation and disruption. The 
digital humanities is really an insurgent humanities.” The 
project is insurgent in relation, first, to the present exclu- 
sionary structures of access and accreditation and, second, to 
the hegemony of global capitalism of which those structures 
are an extension. Digital humanities, declares the Manifesto, 
“have a utopian core shaped by its genealogical descent from 
the counterculture-cyberculture of the ’60s and ’7os. This is 
why it affirms the value of the open, the infinite, the expansive 
[and] the democratization of culture and scholarship." 


—Stanley Fish, “The Digital Humanities and the 
‘Transcending of Mortality,” Opinionator, The New York 
Times, January 9, 2012 


This volume is designed to explore the ways in which digital lit- 
erary scholarship has developed in the last twenty years, but it 
is not an answer to the perennial question “what are the digital 
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humanities?" 'Ioo much of the current work on digital scholar- 
ship has tried to define the field or practice within a contempo- 
rary framework, often insisting that the digital humanities are a 
revolutionary force to challenge all that comes before. In reality, 
the digital humanities owes a debt to a number of theoretical 
strains as outlined in this volume, and its ability to alter struc- 
tures, whether theoretical or institutional, is always already con- 
tained by the very infrastructure in which it currently exists, the 
American academy. To argue that digital humanities are a brand 
new construction seems naive at best, destructive at worst, as it 
creates an inaccurate binary that leads to increasingly hysterical 
readings of the digital humanities as usurper. As I revised this 
chapter, Adam Kirsch’s “Technology Is Taking Over English De- 
partments: The False Promise of the Digital Humanities” made 
waves within the academic community with charges of the *obvi- 
ously anti-humanistic manifestations of digital humanities.”! As 
this project reveals, various humanistic traditions, theories, and 
practices do indeed construct the subfield of digital literary criti- 
cism, and the charge of antihumanism ignores a long history of 
borrowing and modifying from the traditional humanities prac- 
tices, from the fields of textual studies, literary criticism, and cul- 
tural criticism. In this book, I separate what Matthew Kirschen- 
baum calls the “construct of a ‘digital humanities" from the 
practice of digital humanities by scholars of literature.” Instead of 
focusing on the institutional use of digital humanities or digital 
humanities as a (hopeful/destructive) political term, I am much 
more interested in the work of digital humanities as it has been 
and will be enacted. At the same time, this project recognizes that 
the future of digital literary studies is yet unmapped and various 
institutional, political, and economic forces may alter how the 
field develops in yet unknown ways. To this end, the project has 
focused on how current digital literary practice understands its 
relationship to what has come before and what will follow. 
Accordingly, this book has resisted the notion of the digital 
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humanities as “big tent,” a field or practice or theory that is broad, 
expansive, and inclusive, instead insisting on specificity as a coun- 
ter narrative to that of the sweeping representation of revolu- 
tionary change. While some may see the segmentation of digital 
humanities as counterproductive, I argue that digital humanities 
must be particularized because DH, as enacted, is so broad, dif- 
fuse, and flexible that a generalized definition does not adequately 
address the various digital approaches currently in use nor how 
certain humanities fields are being altered by digital practice. As 
‘Tom Scheinfeldt notes, “I believe an examination of our differ- 
ent disciplinary histories will advance even our interdisciplinary 
purposes: understanding what makes us distinctive will help us 
better see what in our practices may be of use to our colleagues in 
other disciplines and to see more clearly what they have to offer 
us." While I agree with Scheinfeldt that we do need to develop an 
understanding of multiple histories, I want to resist seeing mul- 
tidisciplinarity as merely a catalog of items that other disciplines 
might offer to digital practice, as such an approach suggests coop- 
tation and exploitation. A far more productive understanding of 
our collective histories is to identify the borders of practice and to 
look for disciplinary overlaps that benefit all partners. 

Digital literary studies, as I have defined the field in this vol- 
ume, is constructed from textual studies, new historicist theory, 
cultural studies theory, and computational applications, each of 
which compose the theoretical framework for our current prac- 
tice. Initial digital edition production utilized book technologies 
to become the first visible form in digital literary studies, and dig- 
ital edition production—from the recently launched Scholarly Ed- 
iting, which features "small" editions, to the digital conversion of 
long-standing print editorial projects, such as the DigitalDonne: 
The Online Variorum—remains a cornerstone of contemporary 
scholarship. Textual studies and book history have given us theo- 
retical structures and methodologies by which we might under- 
stand technologies of production, approaches we might borrow 
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from print production and apply to digital production, with ex- 
emplary projects like Matthew Kirschenbaum’s Mechanisms: New 
Media and the Forensic Imagination revealing the continuity be- 
tween the seemingly disparate areas. At the same time, we have 
also inherited the unfortunate representation of textual studies as 
mechanistic and antitheoretical. Primarily associated with edit- 
ing as an application, digital humanities insistence on more hack, 
less yack has been interpreted as a resistance to theoretical en- 
gagement in favor of application and left us open to charges of 
“DH as a refuge from theory.”* Regardless of a clear tradition of 
theoretical engagement within textual studies and the practice of 
editing, some, like Richard Grusin, condemn digital humanists 
of creating an “invidious distinction between making things and 
merely critiquing them. As we move forward, it is important 
to reveal how critiques of digital humanities as antitheoretical 
are misreading what are actually deep theoretical roots that are 
derived from textual studies. 

New historicism has taught us how to situate our object of 
study within a social structure, whether the object of study is 
a computer program and its interaction with the user, a digital 
object within a larger network, or digital humanities within the 
larger context of literary studies. We might locate the digital hu- 
manist fascination with institutional power structures as a crucial 
legacy sprung from new historicism. As new historicists exam- 
ined the complexities of power in the historical moment, current 
digital humanists often examine power structures within the aca- 
demic institution. Unfortunately, the digital humanists' critique 
of institutional structures is often seen as an attack on the hu- 
manities. For example, many essays in the special issue of differ- 
ences, “In the Shadows of the Digital Humanities,” charge current 
digital humanists with being complicit in the growing critique 
against the humanities and the larger American higher educa- 
tional system. David Golumbia, for example, argues that *[w]ere 
they more concerned with the problems of ideology and more 
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conscious of its tenacity, DHers might see how uncomfortably 
close the doctrine they advocate is to many of the most extreme 
ideological attacks on higher education mounted the world over 
by the political right." Grusin also asserts “that it is no coinci- 
dence that the digital humanities has emerged as 'the next big 
thing’ at the same moment that the neoliberalization and cor- 
poratization of higher education has intensified in the first de- 
cades of the 2 1st century." Such critiques are oddly reminiscent 
of charges levied against new historicism during the height of its 
popularity. Lee Patterson, for example, accused new historicism 
of “unintended conservatism.”* Donald Pease charged new his- 
toricism with “linguistic colonialism."? These charges against new 
historicism are strangely familiar, echoing in much of the most 
charged condemnations of digital humanities. Walter Benn Mi- 
chaels noted of anti-new historicist sentiment: *The debate over 
resistance and complicity is just a rerun of the old debate over 
the possibility of truly radical political change, a rerun made pos- 
sible by the historicist appropriate of an essentially deconstruc- 
tive model of political difference. Difference in deconstruction 
is crucially subversive, which is to say—translated to the level of 
culture—that differences within a culture must be understood as 
the difference of the culture from itself.” The digital humanities 
insistence on difference, difference in approach to scholarship, in 
types of methodologies, in the way that we do scholarship, is an 
active form of resistance to traditional academic hierarchies that 
many working in digital humanities find limiting. The critique of 
the humanities is not, as Golumbia suggests, an alignment with 
cyberlibertarianism but rather an attack on what many digital hu- 
manists see as academia’s rigid hierarchy, the academy insistence 
on practices that are, at heart, antidemocratic, antimeritocratic, 
and exclusionary. 

Despite the increasing centrality of digital humanities to many 
fields and to literature in particular, digital scholars continue to 
insist that they retain an outsider status in large part because they 
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want to see themselves as challenging exclusionary institutional 
practices. For example, blog writer bitnetted denies that DH 
scholars exert institutional power; *On the whole, DH people are 
not as structurally empowered within the academy as the theory 
leaders of the 80s and gos were. That there are some senior ‘names’ 
in the field is great, but the field itself is still heterogenous [s/c] and 
developing. As was pointed out in several sessions, many DH peo- 
ple are grad students, non T T, or staff. Those who are attempting 
to do such work from the TT are stressed about whether any of 
it will count towards T'&P, especially if they are of the making or 
coding variety of digital humanists.”'! Lisa Spiro jokes that DHers 
as “in-crowd” is “an ironic label for a group of people who have 
long felt like misfits."'? Others, such as Stefan Sinclair, argue that 
“so-called stars" are not really stars because they “are truly among 
the most humble and down-to-earth colleagues I can imagine. 
These are people who love doing their work and who spend an 
unbelievable amount of their time contributing altruistically to 
the community. These are people who volunteer huge amounts 
of time working behind the scenes on committees, advocating for 
the digital humanities at various levels, helping to provide support 
and expertise for other colleagues, mentoring junior colleagues 
formally and informally, and the list goes on."? The resistance 
to seeing digital humanists as those who have some sort of ac- 
ademic star quality or power in the profession has become the 
norm, ironic in the face of the increasing opportunities for those 
who are visible digital humanities scholars. Regardless of insis- 
tence by digital humanists that they are outsiders, it is abundantly 
clear that digital humanities has begun to exert power within tra- 
ditional academic structures. Perhaps the best articulation of the 
way in which power and privilege works within the system comes 
from Kirschenbaum, who argues that while certain forms of tech- 
nology can act as a “democratizer: the individual with a 4/4 load 
at an isolated teaching institution can wield influence in ways that 
would have been unthinkable in the theory-era,” DH stardom and 
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power “is not any less divorced from the real world balance of 
academic power, which (still) manifests in the form of jobs, grants, 
publications, invitations and all the rest of the apparatus.”'* We 
need critical attention to the fluid relationship of power struc- 
tures and their relation to academic infrastructure, and we need 
DH scholars to be sensitive to how such power dynamics replicate 
privilege. 

Current critiques couched as a battle between insiders and 
outsiders are unnuanced and potentially destructive forms of re- 
sistance to what could be productive dissention. For some, the 
resistance to critique comes because of what seems to be a lack of 
understanding about positionality. Roger Whitson, for example, 
says “movements like #transformDH (a group that has criticized 
digital humanities for its lack of attention to race, ethnicity, gen- 
der, class and sexuality) baffle me . . . if my experience with the 
MLA is any indication, the digital humanities doesn’t need to 
be changed"? Whitson’s dismissal of #transformDH’s response 
to MLA lacks a critical positionality that replicates exclusionary 
practices. Much like those who continue to insist on the outsider 
status of DH, Whitson doesn’t recognize when he slips into a 
position of power or at least comfort from which others might 
continue to be excluded. Such concerns have also turned on the 
specter of race that continues to overlay discussions of who is in 
and who is out in digital humanities in increasingly uncomfort- 
able, problematic ways. Reverberations of race thread through 
digital humanities writing, including Jean Bauer’s “Who You 
Calling Untheoretical.” Written in response to Natalia Cecire’s 
blog “When DH Was in Vogue; or, THATCamp Theory,” Bau- 
er’s use of the phrase “who you calling” connects linguistically to 
accusations of racial impurity. As early as the nineteenth century, 
Southern apologist and plantation myth author Thomas Nelson 
Page used “Who you callin’ nigger” in his work In Ole Virginia. 
Page recirculates the phrase in Unc’ Edinburg and, throughout 
the nineteenth and twentieth centuries, the skewed grammati- 
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cal construction was used when charging a character with racial 
taint. The lack of critical awareness of the problematic historical 
legacy invoked by the phrase is particularly poignant when we 
recognize that the blog is a response to Cecire's conscious play on 
“When the Negro was in Vogue,” a Langston Hughes essay criti- 
cal of whites who frequented Harlem during the jazz age to view 
the exotic spectacle of blackness. The connection of impurity to 
those interested in shifting digital humanities work to discussions 
of race is by no means intentional racism but a product of a long 
cultural use of race to represent contagion. We should not be 
surprised to see such a treatment of race show up within such 
discussions, as the American academy is, at heart, connected to 
deep inequities based on race and class. Instead of attacking the 
individuals who have utilized such analogies, we need to embrace 
the chance to have engaged conversations that strengthen the 
work that we conduct. 

In fact, the embrace of cultural studies criticism and what the 
field has taught us about power dynamics models must be in- 
tegrated into digital humanities if we truly want revolutionary 
shifts in academic culture. If we imagine, as Alan Liu suggests, 
digital humanities as a “service” “advocating the humanities in 
its present moment of need,”'® then we might begin to envisage 
how to deconstruct power structures in the institute in which 
digital humanities resides, the university. In February 2010, Wil- 
lard McCarty posted an analysis of a Yale graduate student con- 
ference, “The Past’s Digital Presence,” to the Humanist listserv. 
Quoting Ed Ayers, McCarty agreed that the conference might 
prove “a watershed event in the history of our field in the U.S.””” 
McCarty’s comment drew numerous responses to Humanist. 
Some questioned if we should celebrate such initial explorations 
just because they were happening at an Ivy when scholars on the 
ground at places like Maryland, Virginia, Nebraska, and Ken- 
tucky, among others, have been working with digital humanities 
for years, often at great expense to their own careers. As Amanda 
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Gailey eloquently argues, “I view the late arrival of the Ivies as 
a worrisome indicator that DH will soon be locked down by the 
same tired socio-economic gatekeeping mechanisms that pre- 
vent many people with talent from succeeding at so many other 
academic disciplines . . . [I]t is quite possible that a hitherto un- 
proven field, within which smart people not housed at the most 
selective and expensive universities could actually earn influence 
and rewards, is becoming less egalitarian.”!* Gailey's response to 
the conference is one which is worth taking seriously. If DH is 
going to effect change, then we must continually evaluate power 
dynamics. If we do not, then DH will be subsumed into the larger 
academic culture and the revolutionary impulse will be denied. 

Even detractors of digital humanities recognize that technol- 
ogy has already made major changes in the study of humanistic 
thought and production of scholarly work. Production of the ob- 
jects of study is increasingly digital, and clearly the mechanisms of 
our scholarly output, the presses that produce, say, monographs, 
are already responding to technological change. While we may 
bring our traditional methodologies and theories to bear on tech- 
nologically produced humanities materials, we will also want to 
respond, in the way humanists have always responded, to the shift- 
ing landscape of cultural production. Much of what we are now 
able to do with algorithmic interpretation has never before been 
possible, yet there is much experimentation yet to occur. Instead of 
clinging to our old methods, it seems that humanists would be best 
served by experimental uses of new tools to expand the humanistic 
inquiry in which we have long been engaged. 

Because the digital humanities continue to have such a long 
history of interaction with and derivation from recognizable hu- 
manistic models, it is surprising that DH is often seen as threat- 
ening and antihumanistic. It seems likely that the threat posed by 
DH stems from the institutional, structural changes that digital 
humanists advocate. Kirschenbaum highlights the ways by which 
digital humanists work differently than traditional humanists: 
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*. . . the digital humanities today is about a scholarship (and a 
pedagogy) that is publicly visible in ways to which we are gener- 
ally unaccustomed, a scholarship and pedagogy that's bound up 
with infrastructure in ways that are deeper and more explicit than 
we are generally accustomed, a scholarship and pedagogy that is 
collaborative and depends on networks of people and that lives an 
active, 24/7 life online." Kirschenbaum closes his essay by asking: 
“TIsn’t that something you want in your English department?” 
But the very issues he points to, the very issues that make DH 
unique within the humanities—collaboration, real time schol- 
arship, open access, restructuring of academic hierarchies—are 
exactly the structural and infrastructural points of tension with 
traditional humanists, the point of resistance for many in English 
departments. For many scholars, the more public the publication, 
the more suspicious the scholarly rigor of the work. Public blogs 
are accused of being CV filler rather than seen as disseminating 
our work to a broader public or, worse, selling out to a neolib- 
eral plot of assigning use value to our scholarly production. As 
digital humanities makes inroads into traditional departments, 
conferences, journals, and fields, it becomes more and more ap- 
parent that the thing that is threatening about digital humanities 
work is not the work itself but how digital humanists choose to 
work. Creation of a digital text makes sense within a disciplinary 
framework but working in collaborative teams to produce multi- 
authored scholarship is a practice far afield from how most liter- 
ary scholars work.? My participation in the 2011-2012 NINES/ 
NEH Summer Institute on Evaluating Digital Scholarship made 
clear just where the tensions in the new working structures ap- 
peared. For participants in the institute, departmental and uni- 
versity administrators, forms that looked like traditional forms, 
such as digital journals or even articles and books analyzing digi- 
tal tools or approaches, were acceptable modes of scholarship. 
The MLA likewise agrees that an electronic journal is “a viable 
and credible mode of scholarly production."?! Of far greater con- 
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troversy are the types of work that don't have direct analogues to 
what has been traditionally understood to be humanities schol- 
arship. Digital humanities scholars, such as Bethany Nowviskie, 
have argued vigorously against the move to “evaluate the prod- 
ucts of digital scholarship as if they can be mapped neatly to 
unary objects and established categories, such as journal articles 
or monographs,””’ but it is clear that many traditional humani- 
ties scholars continue to look for print analogues to digital ob- 
jects. Equally problematic is the suggestion that digital projects 
are best valued by the publication of an essay or book discuss- 
ing the project, another way to bypass a disciplinary culture shift 
in assessment. Resistance to valuing the tool, database, or code 
as scholarship in and of itself puts an extra burden on scholars 
working with digital production. 

In 1989, R. G. Potter called for a revision of literary studies; 
“What we need is a principled use of technology and criticism 
to form a new kind of literary study absolutely comfortable with 
scientific methods yet completely suffused with the values of the 
humanities."? We need to work together, in the shared spaces, 
to develop working models that best articulate our hopes. If we 
do indeed believe in digital humanities as transformative then 
we must continue to excavate and to rebuild the structures that 
underpin our work and our community. Our scholarly work at 
the intersections of technology and humanities is important, but 
itis our work that challenges power structures that is crucial. DH 
might shift exclusionary practices that have long run roughshod 
over the best impulses of academia, but to shift practices we can- 
not become what we battle. 
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